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INTRODUCTION 


A  wide  range  of  subjects  have  been  treated  in  this  contract  We  have  devoted  time  to 
the  development  and  applications  of  two  first-principles  computational  methods:  one,  the  full- 
potential  linear  muffin  tin  orbital  (FP-LMTO)  medi^  is  somewhat  mature  and  highly  accurate, 
while  the  other,  linear  combination  of  atomic  orbitals  (LCAO),  is  less  accurate  but  more  flexible 
and  is  easily  incorporated  into  the  other  calculations  we  have  in  place,  e.g.,  surface  Green’s 
fimction  methods  and  CPA.  light  binding  has  also  been  used. 

These  methods  have  been  applied  to  solve  a  host  of  mechaiucal-property  problems 
including  elastic  constants,  cleavage  energies,  sublimation  energies,  interactions  between  surface 
atoms  relating  to  their  surface  order-disorder  state  and  growth  theory,  surface  segregation,  bulk 
order-disorder  theory  and  phase  stability,  the  effect  of  dislocations  on  electronic  transport  and 
electro-optic  properties  of  semiconductors,  the  Ni-Al  intermetallic  phase  diagram,  planar  fault 
energies  in  LI 2  alloys,  high-performance  structural  n^tal  alloy  design,  and  a  contribution  to 
understanding  the  Jones  theory  of  metal  alloying.  Many  of  these  subjects  have  been  brought  to 
publishable  conclusions.  Whenever  possible,  we  have  presented  our  detailed  results  in  the  form 
of  preprints  and  reprints,  with  only  Mef  summaries  of  the  work  given  here.  In  instances  where 
the  research  is  incomplete,  we  have  given  somewhat  longer  expositions. 
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2.  SEMICONDUCTORS 


2.1  DEVELOPMENT  OF  AB  INITIO  THEORIES 

2.1.1  KKR  Theory 

Many  of  our  previous  calculations  were  based  on  the  empirical  tight-binding  method 
(ETB).  This  method  is  most  useful  as  an  inteipolation  scheme  in  which  the  properties  of  refer¬ 
ence  materials  are  well  known.  The  accuracy  is  expected  to  deteriorate  if  ETB  is  used  to  extrap¬ 
olate  too  far  from  known  systems.  For  exan^le,  ETB  should  be  reliable  for  alloy  systems 
because  the  end  points  of  the  alloys,  namely  the  constinient  compounds,  are  well  described  by 
ETB  and  the  alloy  structures  are  very  similar  to  those  of  the  hosts.  However,  we  are  not  as  con¬ 
fident  regarding  surface  calculations  because  the  broken  bonds  at  the  surface  are  very  different 
from  the  tetrahedral  bonds  of  the  bulk.  For  surface  problems,  an  ab  initio  approach  is  more 
suitable. 

In  addition  to  the  full-potential  LMTO  method  which  has  been  established  at  SRI  (see 
Section  2.1.3),  efforts  have  been  made  to  develop  an  ab  initio  LCAO  scheme  and  a  multiple¬ 
scattering  KKR  (MSKKR)  (jteen’s  function  theory  (Yeh  et  al.,  1990;  Appendix  A)  using  space¬ 
filling  potentials.  The  reason  for  studying  LCAO  is  that  it  can  be  transformed  into  the  ETB  form 
for  which  we  have  developed  many  useful  techniques  that  can  be  employed  directly,  such  as  the 
CPA  and  the  difference  equation  approach  (see  Section  2.3.1) .  Similarly,  once  we  can  deal  with 
nonspherical  potentials  in  Green’s  function  formalism,  we  can  extend  the  KKR-CPA  and  defect 
calculations  using  self-consistent  potentials  to  semiconductors  and  alloys. 

A  numerical  test  of  the  MSKKR  theory  using  the  Mathieu  potential  shows  excellent  con¬ 
vergence  for  bands  up  to  1.5  Ry  (Yeh  et  al.,  1990).  This  method  has  been  implemented  in  a  total 
energy  calculation  for  silicon  and  m-V  compounds  within  the  Harris  functional  approximation. 
Results  from  the  first  successful  test  of  the  LCAO  on  diatomic  molecules  have  b^n  obtained. 
The  extension  to  bulk  semiconductors  using  limited  basis  and  the  Harris  approximation  has  also 
been  obtained.  To  make  these  approaches  applicable  to  surfaces,  more  work  incorporating  self- 
consistency  with  an  expanded  basis  set  is  needed. 

2.1.2  Linear  Combination  Of  Atomic  Orbitals:  Theory 

The  empirical  band  structures,  even  when  they  are  carefully  constructed  for  the  cases 
studied,  are  limited  in  versatility  and  accuracy,  while  most  of  the  existing  first-principles  band 
structures  are  computationally  too  demanding  to  study  many  realistic  problems.  The  best — and  a 
numerically  tractable — first-principle  theory  is  given  by  the  local  density  approximation  or  LDA 
(Hohenberg  and  Kohn,  1964;  Kohn  and  Sham,  1965;  Baraff  and  Schluter,  1983). 
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One  of  the  widely  used  methods  to  solve  LDA  equations  is  based  on  self-consistent 
pseudopotentials  with  plane- wave-basis  expansions  (Ihm  and  Cohen,  1980;  Nielson  and  Martin, 
1985;  Ismail  et  al.,  1991).  The  electronic  and  structural  properties  such  as  total  energies,  lattice 
constants,  bulk  moduli,  and  phonon  frequencies  are  often  in  excellent  agreement  with  experi¬ 
ments.  Although  this  method  has  a  built-in  simplicity  in  constructing  the  Hamiltonian,  the 
matrix  size  can  be  unmanageably  large  in  studies,  for  example,  of  superlattices,  defects,  surfaces, 
and  high-field  transpon  that  involve  a  large  number  of  atoms.  Other  accurate  methods  that  use 
linearized  muffin-tin  orbitals  (Andersen,  Jepsen,  and  Sob,  1987;  van  Schilfgaarde  and 
Methfessel,  1990),  and  linearized,  augmented  plane  waves  (Zunger  and  Freeman,  1977;  Mbaye, 
Ferreira,  and  Zunger,  1987)  will  also  be  time  consuming,  particularly  in  the  studies  listed  above. 

One  way  to  overcome  this  hurdle  is  to  use  the  linear  combination  of  atomic  orbitals 
(LCAO)  method  in  a  tight-binding  (TB)  form.  This  first-principles  tight-binding  (FPTB) 
Hamiltonian  uses  pseudo-atomic  orbitals  as  the  basis  for  a  self-consistently  determined 
Hamiltonian  matrix.  In  a  simple  application,  these  orbitals  are  computed  self-consistently  from  a 
free-atom  calculation  within  LDA  and  used  in  the  solid  with  a  non-self-consistent  Harris  approx¬ 
imation  (Harris,  1985).  This  method  has  been  shown  to  be  computationally  fast  and  precise 
enough  to  be  useful  for  a  variety  of  problems  (Jensen  and  Sankey,  1987, 1989;  Sankey  and 
Niklewski,  1989;  Chelikowsky  and  Louie,  1984;  Vanderbilt  and  Louie,  1984).  A  major  advan¬ 
tage  of  the  LCAO  method  over  other  methods  is  its  availability  in  the  familiar  TB  form  which  is 
perfectly  suitable  for  all  the  tasks  studied  here.  In  particular,  incorporation  of  this  Hamiltonian  in 
the  Green’s  function  approach  whether  to  study  surface-driven  ordering,  to  obtain  coherent 
potential  approximation  (CPA)  limited  alloy  band  structures,  or  to  study  the  effect  of  impurities 
and  defects  on  band  structures,  is  straightforward. 

2.1 .2.1  LCAO  Band  Structures 

In  this  section,  we  briefly  describe  the  LCAO  method.  Details  wiU  be  published  else¬ 
where.  By  assuming  only  four  orbitals  (sp3)  per  atom,  we  studied  the  band  structures  and 
structural  properties  of  Group  IV  elements  and  their  alloys,  and  of  ni-V  compounds.  Readily 
available  pseudopotentials  are  used  within  LDA  in  the  Kohn-Sham  equation  for  individual 
atoms.  A  wave  function  obtained  in  this  way  at  a  site  0  is  denoted  as  )al^),  where  a  is  s,  px,  Py, 
or  Pz,  and  j  is  anion  or  cation.  In  the  solid,  the  wave  functions,  which  are  assumed  to  be  a  linear 
combination  of  these  atomic  wave  functions,  satisfy  the  Kohn-Sham  (KS)  eigenvalue  equation 

Vi  =  eiVi  (1) 

where  i  is  the  band  index.  The  coulomb  and  exchange-correlation  (xc)  terms  depend  on  the  input 
electron  density,  pin(r)  and  the  output  charge  density  Pout  =  2Zi|Vip.  The  equation  is  solved  for 
self-consistency  (i.e.,  pin  =  Pout);  ^xc  is  the  total  derivative  of  p  time  e^c  with  respect  to  p.  Then 
the  total  energy  is 

E,ot  =  2  X  Ci  +  ^  Z  '  j  ^  pVcoui  j  +  J  dr  P(r)  [exc(p)  -  Pxc(p)  ]  •  (2) 

When  the  lattice  periodicity  exists,  a  Bloch  state  for  a  given  crystal  momentum,  k,  is  constructed 
as 


2^  ^core  ^nonloc  ^coul  ■*"  Pxc 
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(3) 


=  log)  . 

In  addition,  we  expand  the  Bloch  states  in  terms  of  plane  waves,  and  we  get 

lakj)=  Xck(*^  +  G)l(k  +  G))  .  (4) 

G 

From  Eqs.  (1)  and  (2),  we  get  the  coefficients  in  terms  of  the  Fourier  coefficients  of 
the  a  orbital  of  the  j  atom.  Once  we  obtain  the  Fourier  transform  of  the  kinetic  energy  and  all 
potentials  of  the  KS  equation,  then  a  k-space  8x8  Hamiltonian  is  easily  constructed. 

As  a  test  of  accuracy  of  the  results  obtained  with  these  LCAO  Hamiltonian,  we  initially 
carried  out  the  calculations  with  an  assumption  that  tiie  charge  density  in  the  solid  a  superpo¬ 
sition  of  atomic  charge  densities.  This  approximation,  commonly  known  as  the  Harris  approxi¬ 
mation  (Harris,  1985),  removes  the  self-consistency  loop  and  has  been  shown  to  3deld  reliable 
trends  in  bulk  semiconductors.  In  this  approximation,  the  (Coulomb  potential,  Vcoiil>  which 
depends  on  the  total  charge  density,  also  reduces  to  the  sum  of  contributions  from  individual 
atoms.  However,  the  exchange  correlation  term  is  the  most  difficult  one  because  of  its  compli¬ 
cated  dependence  on  the  charge  density.  A  Fourier  transform  of  the  exchange  correlation  poten¬ 
tial  is  obtained  by  direct  space  integration  within  a  parallelopiped  unit  cell  that  contains  two 
atoms.  Ten  special  k-points  are  used  to  calculate  the  Etot  as  given  in  Eq.  (2) 

To  study  problems  such  as  molecules,  defects,  and  surfaces,  it  is  necessary  to  have 
Hamiltonian  matrix  elements  connecting  any  two  orbitals  in  real  space.  For  a  periodic  lattice, 
such  a  matrix  can  be  obtained  by  simply  integrating  the  above-calculated  k-space  Hamiltonian 
(with  an  appropriate  phase  factor)  over  the  Brillouin  Zone  (BZ).  For  a  general  case,  we  con¬ 
structed  the  real-space  Hamiltonian  by  expanding  off-site  functions  in  spherical  harmonics 
defined  with  respect  to  a  fixed  origin.  All  integrals  are  then  carried  out  in  direct  space.  For  the 
case  with  lattice  periodicity,  Hamiltonians  obtained  with  these  two  methods  have  been  tested  and 
shown  to  agree  with  one  another. 

Band  structure  and  total  energy  calculations  have  been  carried  out  for  (jroup  IV  elements, 
SiC,  SiGe,  and  ni-V  compounds.  The  structural  properties  such  as  equilibrium  bond  length,  d©; 
cohesive  energy  per  bond,  Eb;  and  bulk  noodulus,  Bq  are  obtained. 

The  results  obtained  for  silicon  are  in  excellent  agreement  with  experimental  results.  The 
do  and  Bq  were  larger  than  the  experimental  values  by  0.4%  and  3%,  respectively.  The  calcu¬ 
lated  cohesive  energy  per  atom,  -5.20  eV,  agrees  well  with  experiment  (-4.64  eV)  aftw  subtract¬ 
ing  the  spin-polarization  energy  of  -0.6  eV  per  atom.  (The  experimental  ground  state,  as  dictated 
by  Hund’s  rule,  has  two  p-electrons  with  the  same  spin,  where  the  uncorrected  calculations  con¬ 
sider  p-electrons  with  opposite  spin). 

The  results  obtained  frar  other  Group  IV  elements,  such  as  germanium  and  carbem,  are  in 
reasonable  agreement  with  experiments.  For  germanium,  do  and  Bo  differ  from  experiment  by 
0.4%  and  -1 1  %,  respectively,  and  Eb  is  2.36  eV,  which  is  larger  than  the  experimental  value  of 
1 .94  eV.  For  carbon,  do  and  Bo  differ  from  experiments  by  -2.6%  and  10%,  respectively,  and  Eb 
is  3.99  eV,  which  is  again  larger  than  the  experimental  value  of  1.68  eV. 
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Corresponding  results  for  ni-V  compounds  are  considerably  worse.  In  general,  the  do  is 
off  by  2  to  8%,  Bo  is  off  by  10  to  40%,  and  Eb  is  deeper  by  0.2  to  0.9  eV.  Because  of  the  charge 
transfer  from  cation  to  anion,  the  wave  functions  in  the  solid  are  more  localized  than  the  corre¬ 
sponding  atomic-wave  functions.  Linear  combinations  of  atomic  functions  do  not  adequately 
describe  the  wave  function  in  the  solid.  To  improve  matters,  the  basis  functions  are  made  more 
compact  by  multiplying  with  a  function  (van  Schilfgaarde  and  Methfessel,  1990), 

1 

(eP(r-ro)+  l) 

and  renormalizing.  The  parameters  ^  and  r©  are  then  varied  to  obtain  a  local  maximum  in  the 
cohesive  energy.  As  a  general  guiding  rule,  we  start  with  twice  the  value  of  Pauling  radius  for  ro. 
Values  of  1  and  0.5  for  P  in  carbon  and  germanium  respectively,  give  excellent  structural  prop¬ 
erties.  Without  further  modifications,  the  total  energy  calculations  are  carried  out  for  SiC  and 
SiGe  compounds.  The  calculated  excess  energy  that  favors  ordering  in  SiC  and  segregation  in 
SiGe  alloys  are  in  agreement  (within  a  few  millielectron  volts)  with  experimental  values. 

Although  such  a  (maximizing)  procedure  improved  the  accuracy,  the  disagreement  was 
still  very  large  for  III-V  compounds.  We  find  that  increasing  Tq  and  P  reduces  both  the  equilib¬ 
rium  bond  length  and  the  bulk  modulus.  However,  when  these  parameters  are  varied  slightly  in 
the  neighborhood  of  these  ‘equilibrium’  values,  excellent  fits  to  experimental  values  are  obtained 
for  arsenide  and  phosphide  compounds  of  germanium,  aluminum,  and  indium. 

2.1 .2.2  Green’s  Function 

In  all  the  studies  we  carried  out  previously,  we  used  Green’s  functions  with  an  assump¬ 
tion  that  the  basis  set  is  orthonormal.  However,  the  LCAO  band  structure  described  in  the  previ¬ 
ous  section  has  a  basis  set  that  is  far  from  orthonormal.  Hence,  Green’s  function  calculations 
have  to  be  modified  to  the  case  of  nonorthogonal  basis  orbitals.  We  define  a  Green’s  function  G, 

(E  +  i6-H)G=l  .  (5) 

When  the  basis  set  is  orthogonal, 

(E  +  i6)G„p-XHaa'  =  SaP  ,  (6) 

a' 


then 


E  +  i5-E 

where  HI  \|ti)  =  Ejlyi) .  However,  when  the  basis  set  is  not  orthonormal,  Eq.  (5)  becomes 

(E  +  i8  )Saa'  - 1  H«a^  G^P  =  S„p  . 


(7) 

(8) 


a' 

It  can  be  shown  that  all  observables  arc  related  to  G*".  For  example,  the  density  of  states  (DOS) 
is 
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(9) 


p^-llmJ^GSx  . 

^  a 

Note  that  G™p  is  not  a  simple  extension  of  Eq.  (7).  Now  consider  a  Green’s  function  G*^  that 
satisfies 

(E  +  i5  )S„o.G^3-X  Mao' Gtfp=  Sop  •  d®) 

a' 


Then 

G^^ - 1 -  (11) 

ES  -  H  +  i5 

whose  matrix  elements  are  obtained  as  in  Eq.  (7).  Now  we  can  sec  from  Eqs.  (8)  and  (9), 

G"*=GkS  .  (12) 

Hence  the  prescription  to  calculate  the  Green’s  function,  when  the  basis  set  is  nonorthogonal,  is: 

(a)  Obtain  the  Green’s  function  in  the  usual  way  after  replacing  E  +  i8  by  ES  +  i5. 

This  is  known  as  the  contravariant  Green’s  function,  G*^. 

(b)  Then  multiply  G*^  by  the  overlap  matrix  S  to  obtain  the  mixed-mode  Green’s  func¬ 
tion,  G™.  DOS  is  obtained  from  this  Green’s  function  as  shown  in  Eq.  (8). 

After  careful  analysis,  we  are  convinced  that  the  difference  equation  method  (sec  Section 
2.3.1)  we  developed  (Chen  ct  al.,  1989)  can  be  modified  to  follow  the  above  prescription.  In 
addition  to  the  H  matrix,  we  have  to  calculate  inter-  and  intralayer  overlap  matrices.  Because  the 
difference  equation  method  is  valid,  calculation  of  surface  and  interface  Green’s  functions  are 
obtained  without  any  difficulty.  However,  we  find  that  the  term  values  and  interaction  matrix 
elements  of  surface  atoms  are  much  different  (as  much  as  2  cV  in  some  cases)  from  their  corre¬ 
sponding  bulk  values.  In  such  cases,  the  surface  DOS  is  obtained  in  two  steps.  First,  by  assum¬ 
ing  the  same  term  values  on  the  surface,  we  use  the  difference  equation  method  to  obtain  G*^. 

We  then  consider  the  difference  in  term  values  to  obtain  the  final  Green’s  function  in  a  Dyson’s 
expansion.  Again,  nonorthogonality  requires  some  generalization  and  it  has  been  included. 

2.li2.3  Generalized  Perturbation  Method 

In  the  generalized  perturbation  method  (GPM),  the  excess  energy  required  to  substitute 
CPA  atoms  by  the  constituent  atoms  is  calculated  from  the  Green’s  function  obtained  by  Dyson’s 
expansion  (Turchi  et  al.,  1987).  Because  of  nonorthogonality,  we  must  consider  the  change  in 
the  overlap  matrix,  in  addition  to  the  change  in  the  potential.  Once  we  obtain  G*^  with  the 
Dyson’s  expansion,  Gm  is  obtained  by  multiplying  with  the  appropriate  S  matrix.  The  change  in 
the  DOS  and  energy  are  obtained  from  Gm  in  the  usual  way. 

The  existing  LCAO  Hamiltonian  works  well  for  Group  IV  elements  and  alloys 
(Krishnamurthy  et  al.,  1992).  However,  the  Hamiltonian  for  III-V  compounds  is  better  than  the 
usual  empirical  TB  Hamiltonian  only  in  the  sense  that  it  has  fewer  (two)  parameters  and  the  spa¬ 
tial  dependence  of  all  matrix  elements  are  known.  The  Harris  approximation  with  only  four 
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orbitals  seems  to  break  down  for  ionic  compounds.  For  a  only  first-principles  LCAO 
Hamiltonian,  we  must  remove  the  Harris  approximation  and  obtain  results  self-consistently. 

Also,  true  atomic  wave  functions  are  more  extended  than  required  to  describe  tho  wave  function 
in  the  solid.  A  systematic  development  of  pseudo-atomic  wave  functions  with  d-orbitals  is  nec¬ 
essary  for  ionic  compounds. 

Our  calculations  of  band  structures  suggests  that  interactions  up  to  third  neighbors  should 
be  included  for  reliable  results.  The  existing  Green’s  function  programs  consider  only  nearest 
bilayer  plane  interactions.  For  surfaces  oriented  in  the  (100)  and  (110)  directions,  this  does  not 
include  all  interactions  up  to  third  neighbors.  We  are  generalizing  the  method  by  increasing  the 
basic  unit  plane  to  include  four  monolayers.  The  ordering  induced  on  these  surfaces  can  then  be 
studied. 

Although  the  calculated  valence  bands  agree  very  well  with  experiments,  LDA  does  not 
produce  the  band  gap  nor  some  portion  of  the  conduction  bands  accurately.  The  accurate  eval¬ 
uation  of  surface  states  may  require  accurate  conduction  bands.  With  a  self-energy-like  correc¬ 
tion  that  contains  the  effect  of  exchange  and  correlation  among  electrons,  known  as  GW  approx¬ 
imation,  a  fully  accurate  band  structure  can  be  obtained  (Hybertson  and  Louie,  1986, 1987, 

1988).  This  self-energy  term,  which  is  nonlocal,  energy  dependent,  and  non-Hermitian,  is  evalu¬ 
ated  from  the  dressed  Green’s  function,  G,  and  dynamical  screening  matrix. 

2.1.3  Llrtiaar  Muffin  Tin  Orbitals:  Theory 

In  the  past,  all  aspects  of  our  Green’s  function  and  CPA  work  used  empirical  tight-bind¬ 
ing  methods.  Although  these  methods  are  computationally  easy  to  deal  with,  they  suffer  from  a 
lack  of  general  applicability  and  in  many  cases  result  in  large  quantitative  uncertainties.  How¬ 
ever,  much  of  our  current  and  proposed  woric  lies  beyond  the  domain  of  simple  tight-binding 
Hamiltonians;  thus,  our  recent  efforts  have  centered  strongly  around  the  use  of  an  ab  initio 
electronic  structure  method  known  as  the  linear  muffin  tin  orbitals  (LMTO)  method.  While  we 
have  recently  obtained  a  number  of  results  with  this  method  (several  are  described  below),  we 
have  until  now  mostly  experimented  with  the  various  approximations  within  the  method  and 
their  limitations. 

Density-functional  methods  are  quite  powerful  and  remarkably  accurate  for  a  wide  range 
of  materia’s.  Moreover,  they  are  suitable  for  both  electronic  structure  and  mechanical  properties. 
For  example,  native  defect  equilibrium  has  been  studied  in  HgTe  (Berding  et  al.,  1992;  Appendix 
B)  which  is  an  important  feature  in  the  performance  of  HgCdTe-based  focal  plane  arrays.  Their 
chief  drawback  has  been  that  they  are  quite  “heavy”;  that  is,  they  are  difficult  to  implement  and 
very  slow  to  execute.  The  LMTO  method  is  an  efficient  basis  set;  indeed,  it  is  the  most  efficient 
of  all  electronic  structure  methods.  Its  primary  drawback  has  lain  in  the  spherical  approxima¬ 
tions  made  to  the  potential  that  render  it  unsuitable  for  calculating  mechanical  properties  of 
structures  of  low  symmetry.  However,  recent  versions  of  the  method  remove  that  limitation. 

We  now  possess  three  versions  of  the  method. 

•  The  first  uses  the  classical  atomic  spheres  potential  approximation  but  is  highly 
efficient;  we  have  used  it  extensively  in  studies  of,  for  example,  Schottky  barriers. 
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•  The  second  uses  a  proper  full-potential  approximation,  and  thus  solves  the  local- 
density  functional  nearly  exactly.  This  method  is  still  heavy  and  cumbersome,  but 
is  important  to  provide  a  reference  for  the  correct  local-density  result. 

•  Finally,  an  efficient  version  of  the  second  code  implements  the  new  “Harris- 
Foulkes”  method  (e.g.,  Foulkes  and  Haydock,  1989).  The  central  idea  here  is  to 
reevaluate  the  density-functional  at  the  input  density  and  make  a  sufficiently  good 
guess  for  the  density  that  it  obviates  the  need  for  self-consistency.  A  superposition 
of  spherical  (e.g.,  free-atomic)  densities  seems  to  work  remarkably  well  in  almost 
all  cases  studied.  Moreover,  the  resulting  Hamiltonian  looks  quite  similar  to  empir¬ 
ical  tight-binding  HamiltOi.!ans.  Should  this  method  prove  to  be  sufficiently  accu¬ 
rate  (as  it  seems  to  be),  it  should  be  an  attractive  alternative  to  the  widely  acclaimed 
Car-Parinello  method  (e.g.,  see  Galli  et  al.,  1989).  The  Harris  functional,  used  in 
conjunction  with  the  LMTO  method  (or  some  comparably  efficient  method)  will  be 
essential  to  the  study  of  extended  defects  such  as  dislocations. 

Figiue  1  shows  some  results,  in  part  to  demonstrate  the  precision  of  the  Harris  method, 
but  also  with  a  view  toward  computing  results  of  interest  to  us.  Figure  1(a)  compares  the  theoret¬ 
ical  and  experimental  values  of  the  lattice  constant  and  cohesive  energy;  Figure  1(b)  compares 
the  theoretical  and  experimental  values  of  the  elastic  constant,  cn  -  ci2 ,  and  the  bulk  modulus, 
B.  Although  the  theoretical  results  calculated  using  the  Harris  method  fare  less  well  than  do 
fully  self-consistent  calculations,  the  agreement  with  experiment  is  still  quite  good.  Note  that 
errors  in  the  cohesive  energy  are  due  to  problems  with  the  local-density  approximation,  not  due 
to  Harris. 

Figure  2  is  closer  to  the  practical  applications  our  contract  is  aimed  at  We  seek  a  “super¬ 
modulus”  effect  in  a  thin  superlattice  of  CdTe/ZnTe.  In  Figure  2,  the  superlattice  consists  of 
alternating  monolayers,  along  the  <1 10>  direction,  of  CdTe  and  ZnTe.  Interestingly,  the  bulk 
modulus  is  approximately  linear  (or  sublinear)  in  the  composition,  while  the  shear  modulus  is 
supralinear.  This  shear  modulus  corresponds  approximately  to  the  measurements  of  Young’s 
modulus  by  Farthing  et  al.  (private  communication). 


2.2  THEORY  OF  ELASTIC  CONSTANTS 

This  work  emphasizes  three  aspects:  (1)  analytical  relations  between  elastic  constants 
and  microscopic  atomic  quantities,  (2)  quantitative  calculation  of  elastic  properties,  and 
(3)  extension  to  other  mechanical  properties.  Most  of  our  work  concerns  the  first  two  aspects. 
Harrison’s  bond  orbital  model  (BOM)  provides  a  vwy  good  framework  for  the  analytical  work. 
This  model  treats  the  diamond  and  zinc  blende  semiconductors  on  equal  footings.  It  does  not 
need  to  treat  the  Madelung  energies  separately  for  polar  systems,  as  required  in  the  valence  force 
field  model.  Although  Harrison’s  universal  parameters  give  only  a  qualitative  prediction,  the 
trends  are  correct.  More  important,  these  analytical  expressions  relate  the  three  supposedly  inde¬ 
pendent  elastic  moduli  by  the  following  formula: 

9/C44  =  6/(Cii  -  C12)  +  4/B 
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Figure  1 .  Cohesive  energies  and  shear  elastic  coefficients: 
comparison  between  theory  and  experiment 
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COMPOSITION  — X  32600-1 

Figure  2  Calculation  of  shear  modulus  (circles)  and  bult  modulus  (squares) 
for  superlattices  of  CdZnTe  along  (110].  Shear  compression  is 
normal  to  [1 1 0]  planes,  keeping  lateral  area  fixed. 

where  B  is  the  bulk  modulus.  A  test  using  the  experimental  values  shows  that  this  formula  is 
sadsried  to  within  10%  for  all  semiconductors.  We  have  also  modiried  the  parameters  to  make 
the  BOM  a  quantitative  model. 

For  quantitative  calculations,  a  tight-binding  Hamiltonian  similar  to  that  used  in  BOM  is 
adopted.  The  difference  is  that  the  Hamiltonian  is  diagonalized  exactly  in  this  approach  whereas 
it  is  treated  perturbatively  and  locally  in  BOM.  There  are  four  adjustable  parameters:  two  (f  and 
n)  for  the  first-neighbor  tight-binding  Hamiltonian  matrix  elements,  V  =  f  Vg  (do^d)”;  and  two 
(ug  and  m)  for  the  repulsive  pair  energy,  u  -  ug  (dg/d)*".  Here,  Vg  is  Harrison’s  universal  matrix 
element,  d  is  the  distance  between  the  nearest-neighbor  atoms,  and  dg  is  its  equilibrium  value. 
These  four  parameters  (f,  n,  ug,  m)  are  adjusted  so  that  the  model  produces  the  best  experimental 
values  for  the  bond  length,  cohesive  energy,  bulk  modulus,  and  C]  i  -  C12.  These  parameters 
have  been  obtained  for  all  common  diamond  and  zinc  blende  semiconductors.  The  consistency 
of  the  model  has  also  been  confirmed  by  its  ability  to  predict  other  elastic  properties  such  as  C44 
and  TO  phonon  frequencies  (Yeh  et  al.,  1991;  Appendix  C).  The  resulting  Hamiltonian  has  also 
been  used  in  other  studies  (Sher  et  al..  Appendix  D). 

2.3  SURFACES,  INTERFACES,  AND  SUPERLATTiCES 

2.3.1  DHterence-Equatlon  Approach 

The  difference-equation  approach  is  a  tight-binding  (TB)  based  approach  to  layered  struc¬ 
tures  without  the  need  for  using  supeicells.  This  method  takes  full  advantage  of  the  repeated 
units  in  these  structures.  The  TB  equations  in  the  repeated  regions  are  solved  in  terms  of  the 
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characteristic  solutions  to  the  difference  equations  which  relate  the  expansion  coefficients  of  the 
energy  eigen  wavefunction  of  a  layer  to  those  of  the  neighboring  layers.  The  final  problem  is 
reduced  to  a  small  set  of  boundary  equations.  This  approach  also  facilitates  the  calculation  of 
Green’s  functions.  Some  analytical  results  for  one  dimension  have  been  obtained  and  the  theory 
has  been  generalized  to  three-dimensional  systems.  Several  applications  of  this  approach  have 
also  been  made  (Chen  et  al.  1989;  Schick  et  al.  1989,  Appendix  E;  Chen  et  al.  19W;  Herding  et 
al.  1990a). 

2.3.2  Cleavage  Energy 

One  application  of  the  difference-equation  approach  is  the  calculation  of  the  surface  and 
cleavage  energies  in  semiconductors.  We  applied  this  method  to  a  representative  group  of  semi¬ 
conductors  including  silicon,  GaAs,  CdTe,  and  HgTe,  using  a  second-neighbor  tight-binding 
Hamiltonian.  A  Green’s  function  method  was  used  which  exploits  the  symmetry  of  the  surface 
problem  and  leads  to  a  calculation  of  the  surface  energy  directly,  without  requiring  us  to  take  the 
difference  between  two  large  numbers.  We  find  Eyfl  1 1)  =  1360  ergs/cm^  in  GaAs  and 
Eyfl  10)  =  1000, 180,  and  120  ergs/cm^  in  GaAs,  CdTe,  and  HgTe,  respectively.  These  results 
are  in  good  agreement  with  the  available  experimental  values.  The  results  of  this  work  have 
been  published  in  the  Journal  of  Applied  Physics  (Herding  et  al.,  1990a). 

The  LCAO  Green’s  function  developed  above  for  the  surface  was  first  used  to  calculate 
the  cleavage  energy  for  silicon  whose  cleavage  plane  is  (1 1 1).  The  result  was  excellent  bulk 
band  structures  of  silicon  using  our  FPTH  Hamiltonian,  which  includes  all  interactions  out  to  the 
third  neighbor.  Incidentally,  when  the  planar  unit  in  the  difference  equation  approach  is  bilay- 
ered,  all  interactions  up  to  third  neighbor  are  included  in  the  interplane  Hamiltonian.  In  other 
words,  previously  written  Green’s  function  programs  can  be  used  with  appropriate  generalization 
for  nonorthogonality  and  without  further  modification.  The  cleavage  energy  is  calculated  as 
described  previously  (Herding  et  al.,  1990a).  The  calculated  cleavage  energy  of  1260  ergs/cm^ 
for  silicon  is  in  excellent  agreement  with  experiment  at  1 140-1240  ergs/cm^.  Similarly,  the  cal¬ 
culated  energy  for  GaAs  agrees  well  previously  reported  value  (Herding  et  al.,  1990a).  However, 
the  (1 10)  plane  is  the  cleavage  plane  for  GaAs,  and  such  calculations  have  not  yet  been  carried 
out  for  that  plane.  This  is  because  not  all  third-neighbor  interactions  are  included  in  the  interbi- 
planar  interaction  for  the  (110)  surface,  and  this  requires  some  modification  to  existing  Green’s 
function  programs.  These  modifications  are  under  way. 

2.3.3  Energetics  of  Atoms  on  Surfaces 

A  second-neighbor  tight-binding  model  and  the  Green’s  function  based  on  the  difference 
equation  approach  in  Section  2.3.1  were  used  to  calculate  the  removal  energies  of  constituent 
atoms  from  semiconductor  surfaces.  We  found  the  energies  from  a  fully  covered  surface  to  be 
substantially  different  from  those  from  a  nearly  empty  surface.  These  energies  also  vary  from 
surface  to  surface  and  do  not  exhibit  any  simple  relationship  with  the  bulk  cohesive  energies. 

The  energetics  and  a  simple  thermodynamic  growth  model  are  shown  to  explain  anomalies  in  the 
MHE  growth  of  HgCdTe.  Similar  approaches  have  also  been  used  to  calculate  the  cleavage 
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energies  of  semiconductors.  The  results  compare  well  with  available  data.  Several  papers  have 
been  published  on  this  subject  (Krishnamurthy  et  al.  1990a,  Krishnamurthy  et  al.  19%b, 
Krishnamurthy  et  al.  1990c). 

We  studied  surface-induced  ordering  using  the  above-described  procedure  of  surface  and 
LCAO  GPM  Green’s  function.  Although  no  ordering  is  expected  on  a  (1 1  l)-oriented  silicon  sur¬ 
face,  we  carried  out  the  calculations  for  that  surface  as  a  test  of  accuracy.  The  interatom  interac¬ 
tions  on  the  surface  are  strongly  attractive  so — ^in  agreement  with  experiments — we  predict 
growth  with  the  formation  of  islands  (i.e.,  two  dimensional  layer-by-layer  growth).  We  carried 
out  similar  calculations  on  the  (1 1 1)B  surface  of  Gao.5Alo.5  As.  The  excess  pair  energy  is  zero 
within  the  accuracy  of  our  calculations.  However,  the  energy  has  to  be  less  tiian  zero  (in  our  sign 
convention)  to  obtain  ordering  and  indeed,  experiments  on  the  (1 1 1)B  surface  of  this  alloy  do  not 
find  ordering.  Ordering  has  been  observed  on  (100)  and  (1 10)  oriented  surfaces  of  Gao.sAlo^  As 
alloys.  Once  our  Green’s  function  programs  are  extended  to  consider  a  four-layer  unit  plane, 
instead  of  the  bilayer  unit  plane,  we  will  be  able  to  study  phenomena  related  to  other  surfaces. 

2.3.4  Surface  Segregation  In  Alloys 

Pseudo-binary  semiconductor  alloys  almost  always  have  a  surface  concentration  that  is 
different  from  the  bulk  in  order  to  maintain  a  constant  chemical  potential  for  each  layer  in  the 
alloy.  We  have  calculated  the  degree  of  surface  segregation  in  these  alloys,  with  numerical 
examples  presented  for  Hgi.xCdxTe  and  Hgi.xZnxTe.  The  enthalpy  responsible  for  segregation 
is  the  difference  in  the  energies  for  moving  an  A  or  B  atom  from  the  bulk  alloy  to  the  surface. 
There  are  two  major  contributions  to  this  energy  process:  (1)  a  bond-breaking  mechanism, 
whereby  the  element  with  the  lowest  surface  energy  segregates  to  the  surface;  and  (2)  strain 
energy,  where  the  dilute  element  in  the  compound  is  favored  to  segregate  to  the  surface  to  alle¬ 
viate  the  strain  due  to  mismatch  of  the  AC  and  BC  bond  lengths.  We  use  the  quasi-chemical 
approximation  to  calculate  the  free  energy  and  hence  the  chemical  potentials.  Our  results  indi¬ 
cate  that  there  is  strong  surface  enrichment  of  mercury  in  Hgi.xCdxTe  while  there  is  less  surface 
segregation  of  mercury  in  Hgi.xZnxTe  than  the  low  x  values  appropriate  for  infrared  applicar 
tions.  Mercury  segregation  to  the  surface  will  lower  the  bandgap  or  may  turn  the  surface  into  a 
semimetal,  thereby  affecting  the  passivation  of  the  surface.  Several  papers  have  been  published 
on  this  subject  (see,  for  example,  Patrick  et  al.  1989,  Berding  et  al.  1990c). 

2.4  ORDER-DISORDER  THEORY 

2.4.1  Foimatlon  Energies  and  Phase  Stability 

The  tight-binding  model  discussed  above  was  used  to  calculate  the  formation  energies, 
bond  lengths,  and  bulk  moduli  of  a  number  of  III-V  and  11- VI  semiconductor  alloys.  The  input 
parameters  are  those  of  the  constituent  compounds.  The  model  is  then  applied  to  alloys  without 
further  adjustment.  We  conclude  that  none  of  the  ord^ed  alloys  found  experimentally  is  in  its 
stable  bulk  state  at  the  growth  temperatures  (Yeh  et  al.  1991).  Although  the  alloy  excess  ener¬ 
gies  can  be  negative,  if  the  reference  constituent  compounds  are  constrained  to  match  the  sub¬ 
strate  lattice  used  in  the  epitaxial  growth,  their  magnitudes  are  not  large  enough  to  account  for 
the  observed  ordering.  We  present  a  possible  explanation  of  the  observed  states  in  terms  of  a 
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barrier  that  prevents  the  metastable  ordered  alloy  from  decomposing  into  separate  phases.  How¬ 
ever,  this  explanation  applies  only  to  alloys  with  a  large  lattice  mismatch  between  the  constituent 
systems.  The  next  step  is  to  see  if  surface  ordering  is  taking  place  and,  if  it  is,  what  mechanism 
keeps  the  alloy  from  relaxing  to  the  stable  phases. 

2.4.2  Critical  Temperature  of  Order-Disorder  Transitions 

There  is  a  large  literature  on  order-disorder  transition  of  alloys  that  ranges  from  semiem- 
pirical  to  highly  theoretical  models.  The  semiempirical  models  include  the  Bragg- Williams  ideal 
solution  riKxlels,  Guggenheim’s  quasi-chemical  approximation  (QCA),  the  Ising  noKxiel  with 
empirical  pair  interactions,  Strin^ellow’s  S-doping  model,  and  Fedder  and  Muller’s  inqnove- 
ment  of  the  5-doping  concept  which  renooves  some  imprecision.  All  these  methods  miss  portions 
of  the  problem:  they  either  treat  the  excess  energy  of  clusters  in  the  pseudobinary  alloy  Ai.^Bx 
or  Ai-xBxC  as  being  independent  of  concentration,  or  they  make  poor  iqrproximations  to  the 
combinatorial  analysis  of  the  number  of  permitted  arrangements  of  atoms  in  the  alloy.  No  theory 
has  been  published  that  relies  on  near-first-principles  methods  and  proper  combinatorics  and  that 
makes  the  relationship  between  all  these  methods  and  the  true  physics  clear  (and,  more  impor¬ 
tant,  reaches  proper  conclusions).  This  material  has  been  assembled  into  Chapter  4  of  a  bode  on 
semiconductor  alloys  being  written  by  A.-B.  Chen  and  A.  Sher.  A  draft  copy  of  Chapter  4  is 
appended  to  this  report  (Appendix  F);  we  emphasize  that  the  chapter  is  still  undergoing  revision. 

2.44  Cluster  Energies 

We  are  recalculating  the  cluster  energies  that  enter  inro  the  statistical  mechanical  calcula¬ 
tion  of  correlations  and  phase  diagrams.  We  are  using  the  new  parameterized  tight-binding 
model  discussed  in  Section  2.2.1.  Previously,  the  16-bond  cluster  energies  were  calculated  from 
the  sum  of  strain  and  chemical  energies  (Sher  et  al.,  1987;  Sher  et  al.,  1991,  Appendix  G).  Strain 
energies  were  calculated  using  Keating’s  force  constant  model,  while  chemical  energies  were 
calculated  using  Harrison’s  metallization  terms  within  the  extended  bond-orbital  model.  In  the 
present  approach,  we  are  using  a  cluster  calculation  similar  to  that  used  in  computing  the  vacancy 
formation  energies  (Berding  et  al.,  1990b),  in  which  the  16-bond  cluster  Hamiltonian  is  directly 
diagonalized.  We  are  using  our  empirical  tight-binding  Hamiltonian  discussed  in  Section  2.2.1. 
In  this  cluster  calculation,  the  positions  of  the  twelve  atoms  at  the  edge  of  the  cluster  are  held 
fixed  while  the  positions  of  the  five  alloy  atoms  in  the  cluster  are  permitted  to  relax  to  the  equi¬ 
librium  position.  Boundary  conditions  can  be  imposed  by  specifying  the  position  of  the  atoms  at 
the  cluster  edges.  We  are  including  the  nonisotropic  bound^  conditions  appropriate  to  the  cal¬ 
culation  clusters  on  a  lattice-constraining  substrate.  Cluster  energies  will  be  used  in  the  statisti¬ 
cal  mechanics  to  calculate  correlations  on  epitaxial  surfaces.  Results  of  this  wenk  will  be  pub¬ 
lished  upon  completion. 


2.5  DISLOCATIONS  AND  ELECTRO-OPTIC  PROPERTIES 

It  is  observed  that  dislocations  in  semiconductors,  in  particular  GaAs  and  HgCdTe,  have 
a  significant  impact  on  their  electro-optic  and  transport  properties.  Both  in  GaAs  at  room  tem¬ 
perature  and  Hg0.gCd0.2Te  at  77  K,  it  is  found  that  dislocations  begin  to  modify  the  minority 
carrier  lifetime,  or  equivalently  the  diffusion  length,  for  dislocation  densities  in  excess  of 
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105  cm'2,  i.e.,  average  separations  of  ~30  |jm.  At  higher  densities,  the  lifetime  decreases 
quadratically  with  the  dislocation  density  for  low  carrier  concentrations.  This  indicates  that  the 
dislocations  are  somehow  combining  to  speed  the  recombination  process.  In  GaAs,  as  the  carrier 
concentration  increases,  starting  from  low  values  of  cm~3,  the  lifetime  increases  for  a  fixed 
dislocation  density  of  ~105  cm-2;  for  high-enough  concentrations,  where  Shockley  Reed  impu¬ 
rity  recombination  becomes  dominant,  lifetime  decreases  once  more.  Similar  effects  have  been 
observed  in  HgCdTe. 

In  an  attempt  to  understand  the  origin  of  these  and  other  effects  we  have  calculated  the 
state  densities  in  and  near  the  bandgap  of  all  four  types  of  edge  dislocations  found  in  zinc  blende 
structured  semiconductors.  The  four  types  are  those  that  terminate  at  cores  on  anion  or  cation 
lines  either  leaving  one  (the  “shuffle”  set)  or  three  (the  “glide”  set)  dangling  bonds  projecting 
into  the  core.  Some  of  these  dislocation  types  at  0  K  exhibit  partially  filled  states  in  the  gap,  and 
others  have  some  rilled  conduction  band  states,  or  enq)ty  valance  band  states.  Thus,  those  dis¬ 
locations  with  partially  rilled  gap  states  will  be  surrounded  by  depletion  layers  in  both  n-  and 
p-type  material,  and  the  others  will  have  either  depletion  or  accumulation  layers  around  them, 
depending  on  the  carrier  type  of  the  host  material.  The  behavior  of  these  four  different  kinds  of 
dislocations  as  they  cross  a  p-n  junction  remains  to  be  sorted  out. 

The  dislocations  with  depletion  layers  serve  as  efficient  recombination  centers  since  their 
fields  attract  minority  carriers  into  the  core  where  recombination  is  fast.  The  effective  capture 
cross  section  is  high  because,  in  the  simplest  picture  its  length  scale  is  set  by  the  depletion  layer 
thickness.  However,  even  this  large  length  (~1  pm  in  material  with  a  low  carrier  concentration 
of  -10*^  cm~3)  is  still  small  compared  to  the  ~30  pm  separation  between  dislocations  when  the 
dislocations  begin  to  become  effective  lifetime  killers. 

The  fields  associated  with  a  depletion  layer,  in  this  case  around  a  line  charge,  do  not  fall 
abruptly  at  its  edge.  For  a  line  charge,  the  field  outside  the  depletion  layer  is  asymptotically 

/T  r»U/2  fr-Rp) 

proportional  to  e  Ld  where  r  is  the  radial  coordinate  centered  on  the  core  in  cylindrical 
coordinates,  Rq  is  the  dislocation  radius,  and  Ld  is  the  Debye  length.  (For  a  point  charge 
screened  by  free  carriers,  the  rields  fall  faster,  as(^jc-^AD.)  if  the  distance  a  minority  carrier 

drifts  in  these  depletion  layer  rields  in  the  minority  carrier  lifetime  is  comparable  to  the  separa¬ 
tion  between  dislocations,  the  dislocations  become  effective  lifetime  killers.  We  are  still  in  the 
process  of  making  accurate  calculations,  but  our  preliminary  estimates  indicate  that  at  low  earner 
concentrations,  where  Ld  is  longest,  the  rields  are  sufficiently  large  in  these  high-mobility  mate¬ 
rials  to  become  effective.  Obviously  they  will  be  more  effective  in  p-type  material,  where  the 
minority  species  are  the  high-mobility  electrons  in  the  conduction  band. 

Given  this  mechanism,  the  reason  the  lifetime  at  first  increases  with  carrier  concentration 
becomes  a  natural  consequence  of  the  fact  that  Ld  is  inversely  proportional  to  the  square  root  of 
the  carrier  concentration. 

We  expect  the  nonlinear  lifetime  variation  with  dislocation  density  will  also  be  a  conse¬ 
quence  of  the  long  ranges  of  the  rields,  becoming  most  important  when  the  average  separation 
between  dislocations  approaches  the  order  of  the  depletion  layer  radius.  This  study  is  continuing, 
and  must  be  completed  before  our  speculations  can  be  placed  on  a  rirm-enough  footing  to 
warrant  refereed  publication. 
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3. 


HRST-PRINCIPLES  APPROACH  TO  THE  PLASTIC  PROPERTIES 
OF  HIGH-TEMPERATURE  ALLOYS 


3.1  INTRODUCTION 

With  this  end-of-year  report,  current  funding  for  this  program  having  ceased  although  we 
are  at  the  entrance  m  potentially  the  most  productive  phase  of  diis  work,  we  also  attempt  some 
final  ctmclusions.  In  retrospect,  our  work  over  the  past  two  years  has  led  us  into  some  isolated 
flelds  such  as  the  theoreticid  strength  and  the  Jones  theory  of  alloying.  In  parallel,  however,  we 
have  been  developing  the  methods  and  understanding  to  tackle  our  original  long-term  goals.  Of 
Tasks  6  and  7  proposed  in  our  extension  of  16  March  1990,  Task  7  is  completed  and  Task  6  is  as 
good  as  completed.  We  have  also  made  major  inroads  into  the  program  as  it  was  envisioned 
developing  into  Years  3  and  4:  we  have  made  teal  progress  toward  our  goal  of  alloy  design  from 
first-principles  quantum  mechanics.  We  look  forward  to  continuing  this  work — ^hopefully  under 
the  aegis  of  AFOSR — in  the  future. 

The  topics  in  this  section  are  of  three  categories. 

(i)  The  two  items  that  complete  Tasks  6  and  7. 

(ii)  Our  new  work  on  the  transition-metal  trialuminides,  which  embodies  a  genuine 
attempt  at  alloy  design  and  looks  towards  the  development  of  our  work  in  future. 

(iii)  “Jones’s  theory  of  the  Hume-Rothery  phases  revisited”  is  the  title  of  a  new  paper 
(a  draft  of  which  is  included  as  Appendix  H)  written  in  collaboration  with 

M.  Methfessel  and  D.G.  Petdfor.  This  work,  although  somewhat  outside  the 
piuview  of  our  original  prcqrosal,  has  a  significant  impact  in  theoretical  alloy 
design. 

We  report  on  these  tt^ics  in  the  following  four  sections. 

3.2  THE  NI-AI  PHASE  DIAGRAM 

Task  7  was  to  compute  the  Ni-Al  phase  diagram  using  our  total  energy  methods  for  cal¬ 
culating  internal  energy  differences  and  a  suitable  approximate  scheme  for  the  entropy.  This  has 
been  done  in  a  collabt^tion  between  ourselves  and  a  group  at  INPG,  France.  The  work  is  to 
appear  shortly  in  Journal  of  Physics:  Condensed  Maaer  and  a  copy  is  included  as  Appendix  I. 

3.3  PLANAR  FAULT  ENERGIES  IN  LI  2  ALLOYS 

Gdculations  of  planar  fault  energies  (antiphase  bourulaty  or  APB,  complex  stacking  fault 
cft  C!SF,  and  superlattice  intrinsic  stacking  fault  or  SISF)  are  of  central  inqxnrtantx  in  theories  of 
defrmnation  in  the  high-terr^rature  intermetallics,  as  was  en^hasized  in  our  original  proposal 
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and  is  still  evident,  even  though  new  theories  of  anomalous  yield  have  appeared  since  then 
(Dimiduk,  1991).  Task  6  was  to  compute  these  energies  in  Ni3Al  with  particular  attention  paid 
to  the  translation  state  of  the  defect.  We  say  that  this  task  is  as  good  as  completed  because  while 
we  have  the  results  now  for  Pt3Ai,  those  for  CusAu  and  Ni3Al  have  not  been  tested  for  conver¬ 
gence  with  respect  to  supercell  size.  We  can  say,  though,  that  in  these  two  cases  the  errors  are 
approximately  15%  and  that  the  defect  energies  are  overestimated  by  this  order  of  magnitude.  So 
far  we  have  comprehensive  data  only  on  (1 1 1)  defects;  the  study  will  not  be  complete  until  the 
energies  of  (001)  APBs  are  found,  since  it  is  the  ratio  of  (001)  to  (1 1 1)  APB  energies  that  is  of 
particular  interest.  We  regret  that  we  are  not  able  to  continue  since,  as  we  show  below,  the 
results  are  extremely  encouraging.  Because  the  woiic  is  incomplete  and  hence  unpublished,  we 
give  a  detailed  account  here. 

(Dur  approach  closely  follows  that  of  Yamaguchi  et  al.  (1981),  who  used  symmetry  argu¬ 
ments  and  classical  pair  potential  simulations  to  deduce  the  shapes  of  energy  vs.  translation 
curves  for  planar  defects  in  LI2  intermetallics.  They  could  not  say  how  particular  metals  would 
behave  but  only  describe  generic  features  in  the  problem.  We  can  now  supply  the  missing  link 
by  doing  explicit  quantum  mechanical  calculations.  We  illustrate  the  procedure  in  PtsAl. 

We  first  inquire  how  effective  the  Harris-Foulkes  approximation  is,  as  is  our  usual  prac¬ 
tice.  We  have  calculated  the  shear  constants  C  and  C  and  find  good  agreement  with  the  results 
of  Fu  and  Yoo  using  the  full  potential  LAPW  method  (Fu  and  Yoo,  1989).  We  then  compare  the 
Ll2-D0i9  and  LI2-DO22  energy  differences,  which  we  show  in  Table  1,  both  self-consistent  and 
in  the  Harris-Foulkes  approximation.  For  later  reference,  we  show  also  our  results  for  CusAu. 

Table  1 

STRUCTURAL  ENERGY  DIFFERENCES  IN  PT3AL  AND  CU3AU 

(mRy/atom) 


LI2-DO19 

LI  2-^3022 

H-F 

s-c 

H-F 

9-C 

R3AI 

6.6 

6.5 

11.1 

10.5 

CusAu 

1.9 

1.7 

Convinced  that  the  Harris-Foulkes  approximation  is  appropriate  in  Pt3Al,  we  construct 
supercells  containing  APBs  parallel  to  (1 1 1)  planes  and  separated  by  3, 6,  and  9  layers  of  perfect 
crystal.  We  then  calculate  the  APB  energy  in  each  cell  to  monitor  the  convergence  with  cell  size. 
The  results  are  shown  in  Table  2. 

Even  when  the  defects  are  separated  by  only  three  layers  of  crystal,  the  APB  energy  is 
well  converged.  We  then  use  the  three-layer  supercells  to  calculate  the  energy  of  the  SISF, 
which  does  not  relax  in  plane  (Yamaguchi  et  al.,  1981)  and  the  energy  and  translation  state  of  the 
APB  and  CSF.  All  these  defects  are  parallel  to  (1 1 1).  Finally,  we  use  a  six-layer  supercell  to 
calculate  the  energy  of  the  unrelaxed  APB  parallel  to  (001). 

Our  calculations  in  Ni3Al  and  Cu3Au  are  self-consistent  using  three-layer  supercells.  So 
far,  we  have  not  fully  checked  the  convergence  with  respect  to  supercell  size  in  C^3Au  and 
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Table  2 

APB  ENERGY  IN  PT3AL  AS  FUNCTION  OF  SUPERCEU  SIZE 


No.  Of  Layers 

TAPS  (J/m*) 

3 

0.57 

6 

0.54 

9 

0.53 

NisAl.  A  six-layer  self-consistent  calculation  of  the  unielaxed  APB  in  NisAl  gives  an  energy  of 
0.28  J/m2  compared  with  0.33  in  the  three-layer  self-consistent  calculation.  Hence,  we  estimate 
that  the  defect  energies  are  exaggerated  by  about  15%  in  Ni3Al. 

We  now  show  die  results  that  we  have  obtained  so  far.  Each  planar  defect  is  character¬ 
ized  (Y amaguchi  et  al.,  1981)  by  the  fault  plane  (1 1 1)  or  (001),  the  fault  vector  p,  and  the  trans¬ 
lation  vector  T.  Thus,  the  net  displacement  across  the  fault  is  p  +  T.  Table  3  shows  the  charac¬ 
ters  of  each  fault  studied;  as  shown,  the  translation  state  is  characterized  by  the  value  of  t.  The 
CSF,  when  t  =  -1,  reverts  to  a  perfect  crystal  (Y amaguchi  et  al.,  1981).  As  t  ->  1,  the  APB 
becomes  an  SISF.  In  either  case,  t  =  0  refers  to  the  unmlaxed  defect. 

Tables 

CHARACTERS  OF  PLANAR  FAULTS  IN  THE  LI2  LATTICE 


Fault  plane 

p 

T 

SISF 

(111) 

0 

CSF 

(111) 

^ii?l 

^112) 

APB 

(111) 

^011] 

p1l] 

APB 

(001) 

J110) 

1(001] 

In  Table  4,  we  show  planar  fault  energies,  with  relaxed  values  and  translation  states  in 
those  cases  calculated  so  far.  The  only  (001)  APB  energy  so  far  found  is  for  Pt3Al,  and  this  is  an 
unrelaxed  value. 

In  Figure  3,  we  show  plots  of  fault  energy  against  translation  state,  reminding  the  reader 
that  the  results  for  Ni3Al  and  CU3AU  are  estimated  to  be  exaggerated  by  something  around  15%. 
More  work  is  needed  here  using  larger  supercells. 

We  are  deferring  full  discussion  of  these  results  until  the  study  is  completed.  This  is  cer¬ 
tainly  the  first  time  that  such  a  detailed  series  of  calculations  has  been  attempted,  and  we  have  no 
doubt  that  the  results,  when  completed,  will  represent  a  significant  step  forward  in  our  imder- 
standing  of  the  deformation  of  LI2  intermetallics.  We  advance  some  brief  conclusions  now, 
however,  from  the  most  obvious  points  to  emerge  fiom  Table  4  and  Figure  3. 


17 


a/6|<il2>| 


e^6i<ll2>i 


Figured  (1 11)  planar  fault  energies  in  three  LI2  alloys  as  a  function  of  in-plane  translation.  These  local 
density  calculations  may  be  compared  with  similar  cunres  obtain^  earlier  using  pair  potentials 
(Yamaguchi  el  al.,  1981).  The  near  instability  of  the  CSF  in  PtsAI  can  be  seen  in  the  inset 
which  shows  the  curve  on  an  expanded  scale. 


Table  4 

PLANAR  FAULT  ENERGIES  Y<J/m2)  AND  TRANSLATION  STATES  IN  LI  2 INTERMETALLICS 


(111) 

(001) 

SISF 

CSF 

APB 

APB 

y 

Y 

t 

Y 

t 

Y 

t 

0.56 

0.60 

-0.181 

0.50 

0.112 

0.54 

— 

0.30 

-0.117 

0.28 

0.104 

0.12 

0.26 

-0.034 

0.16 

0.028 

We  are  deferring  full  discussion  of  these  results  until  the  study  is  completed.  This  is  cer¬ 
tainly  the  first  time  that  such  a  detailed  series  of  calculations  has  been  attempted,  and  we  have  no 
doubt  that  the  results,  when  completed,  will  represent  a  significant  step  forward  in  our  under¬ 
standing  of  the  deformation  of  LI  2  inteimetallics.  We  advance  some  brief  conclusions  now, 
however,  from  the  most  obvious  points  to  emerge  from  Table  4  and  Figure  3. 

•  In  PtsAl,  which  is  ordered  to  the  melting  point,  the  energies  of  the  SISF  and  (001) 
APB,  in  which  A1  atoms  are  not  forced  into  nearest  neighbor  positions,  are  compa¬ 
rable  with  the  CSF  and  (111)  APB  in  which  there  are  Al-Al  neighbors.  This  is 
surprising  and  contrary  to  the  usual  assumptions  (Flinn,  1960).  Results  (indicated 
in  Table  4)  indicate  that  the  SISF  energy  is  close  to  zero  in  NisAl,  so  that  these  two 
strongly  ordering  systems  differ  in  this  respect  (it  may  be  significant  that  NisAl 
shows  anomalous  yielding  while  PtsAl  does  not). 

•  The  CSF  in  PtsAl  is  practically  unstable.  'ycsF  a  minimum  at  t  =  -0.181  and  a 
maximum  at  t  =  -0.352,  the  energy  barrier  being  0.015  J/m^  corresponding  to 
0.24  kT  per  atom  at  the  interface  at  room  temperature.  Thus,  partial  dislocations 
separated  by  CSF  are  not  likely  to  be  observed  in  PtsAl.  In  contrasL  Ni3Al  and 
Cu3Au,  which  both  show  anomalous  yielding,  have  very  stable  CSFs. 

•  As  expected  from  the  weak  ordering  in  CusAu,  the  APB  and  CSF  energies  on  (1 1 1) 
are  lower  than  in  PtsAl,  and  the  translation  states  are  smaller. 

•  As  has  been  emphasized  by  Vitek  (1990)  but  ignored  in  previous  density  functional 
calculations  (Fu  and  Yoo,  1989),  allowing  the  defects  to  relax  can  lead  to  signifi¬ 
cant  changes  in  the  calculated  fault  energy.  In  the  extreme  case  of  the  CSF  in 
Pt3Al,  relaxation  causes  more  than  10%  lowering.  Now  that  we  can  predict  the 
translation  state,  it  becomes  important  to  look  for  it  experimentally  in  defect  fringes 
and  in  variations  in  Burger’s  vector  of  dissociated  dislocations. 

•  Finally,  even  allowing  for  overestimation  of  15%,  the  calculated  APB  energy  in 
Ni3Al  is  significantly  higher  than  any  of  the  measured  values,  which  are  themselves 
rather  scattered  (Dimiduk,  1991).  We  need  to  investigate  this  discrepancy  in  future 
work.  Maybe  there  is  an  error  in  the  local  density  approximation,  or  maybe  the 
theory  and  experiment  can  still  be  rationalized.  We  are  not  surprised,  however,  to 
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find  disagreement  between  our  results  and  a  previous  local  density  calculation  (Fu 
and  Yoo,  1989),  since  those  authors  used  only  very  small  supercells  and  calculated 
only  two  energies  (ysiSF  sind  ^CSF  +  Yapb)-  As  we  see  from  our  results,  their  claim 
that  YCSF  =  Yapb  +  YSiSF  is  greatly  in  error.  Indeed,  the  equivalence  is  not  even  true 
in  a  pair  potential  description  (Yamaguchi  et  al.,  1981). 


3.4  QUANTUM  MECHANICAL  ALLOY  DESIGN  AND  THE  TRANSITION-METAL  TRIALUMINIDES 

This  work,  undertaken  in  collaboration  with  D.G.  Pettifor,  will  appear  in  the  February 
1992  issue  of  Scripta  Metallurgica  and  a  preprint  is  appended  to  this  report  (Appendix  J).  The 
paper  indicates  the  direction  in  which  our  work  is  progressing  toward  the  long-term  goals  out¬ 
lined  in  our  original  proposal.  A  major  aim  is  to  use  our  first-principles  theory  to  predict  alloy 
compositions  and  stimulate  experimental  programs.  Our  reasons  for  choosing  AI3RU  as  a  likely 
candidate  for  a  ductile  transition-metal  trialuminide  are  given  in  our  paper.  We  have  persuaded 
C.T.  Liu  of  ORNL,  Tennessee,  to  prepare  some  alloys  and  we  look  forward  to  the  results  of  his 
investigations.  So  far,  we  understand,  his  results  are  promising.  We  have  predicted  the  crystal 
structure  of  AI3RU  to  be  DO22  and  not  the  reported  D^:  in  fact,  as  we  point  out,  almost  nothing 
is  known  of  the  equilibrium  properties  of  AI3RU.  We  hope  to  work  with  the  INPG  group  to  pre¬ 
dict  the  Al-Ru  phase  diagram,  which  is  also  known  only  tentatively. 


3.5  THE  JONES  THEORY  OF  ALLOYING 

Jones’s  early  theory  (1937)  of  the  P-phase  alloy  of  ChrZn  has  had  a  turbulent  history,  and 
most  textbooks  are  mislea^ng,  not  to  say  wrong,  on  this  subject.  In  collaboration  with 
M.  Methfcsscl  and  D.G.  Pettifor,  we  have  used  modem  electron  theory  and  an  advanced  method 
of  Brillouin  zone  integration  to  redo  Jones’s  calculation  using  the  rigid-band  approximation  with 
and  without  the  inclusion  of  d  electrons.  From  the  band  structure  alone,  we  can  now  understand 
the  origin  of  the  face-centered  cubic  structure  of  the  a-phase,  and  the  stabilization  of  P-  and  e- 
brass,  as  Cu  is  alloyed  with  Zn.  It  turns  out  that  Jones’s  approach  was  valid  but,  although  his 
conclusions  were  in  perfect  agreement  with  the  facts,  the  agreement  was  entirely  coincidental 
and  due  to  the  approximations  he  made.  Without  explicit  inclusion  of  d  electrons — which  he,  of 
course,  neglected — one  cannot  predict  all  the  phases  of  brass. 

The  writing-up  of  this  work  has  taken  some  time,  and  led  to  a  rather  lengthy  manuscript. 
A  current  draft  is  appended  herewith. 
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Full'potential  Korringa-Kohn-Rostoker  band  theory  applied  to  the  Mathieu  potential 
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The  band  theory  of  Korringa,  Kohn,  and  Rostoker  (KKR)  based  on  the  Green-function  method 
is  extended  to  space-filling  potentials.  A  numerical  test  using  the  Mathieu  potential  shows  good 
convergence  for  the  bands  up  to  I.S  Ry  with  / 14  included  in  the  angular-momentum  expansion  for 
the  wave  functions.  Our  results  strongly  support  the  applicability  of  the  full-potential  KKR  to  bulk 
electronic-structure  problems. 


1.  INTRODUCTION 


The  Korringa,  Kohn,  and  Rostoker  (KKR)  band 
theory is  an  elegant  theory  for  the  one-electron  energy 
bands  in  a  closed-packed  crystal  for  which  the  muffin-tin 
(MT)  construction  for  the  potential  is  a  reasonable  ap¬ 
proximation.  To  expand  the  scope  of  application,  consid¬ 
erable  effort  has  been  expend^  to  extend  the  KKR 
theory  to  full  crystal  potentials.' One  concern  about 
such  extension  is  relat^  to  the  so-called  near-field  correc¬ 
tions  (NFC)  (Refs.  3-S)  arising  from  the  expansion  of  the 
KKR  Green  function  beyond  the  muffin-tin  region.  Al¬ 
though  there  are  proofs'  *  "  showing  that  NFC  do  not 
exist,  questions  have  been  raised  about  the  applicability 
of  the  theory."  Since  .space-filling  potentials  are  non- 
spherical  and  the  Wigner-Seitz  cell  boundary  is  not 
smooth,  we  are  further  concerned  about  the  speed  of  con¬ 
vergence  in  terms  of  angular-momentum  (/)  expansions. 
In  this  paper  the  integral  equation  approach  of  Kohn  and 
Rostoker'  (KR)  is  used  to  derive  the  full-potential  KKR 
(FP-KKR)  equation  explicitly.  One  advantage  of  our 
derivation  is  that  all  the  quantities  involved  are  functions 
of  r  within  a  unit  ceil.  Thus  we  can  avoid  the  uncertainty 
in  extending  the  wave  function  beyond  the  unit  ceil  en¬ 
countered  in  some  other  derivations.*  "  We  have  also 
tested  the  convergence  by  comparing  the  numerical  re¬ 
sults  with  the  exact  solution  for  the  Mathieu  poten- 
tial"~"  in  the  simple-cubic  crystal.  Excellent  results  for 
the  band  structure  in  the  energy  range  of  interest  are  ob¬ 
tained  with  a  maximum  value  of  /  =4  included  in  this  ex¬ 
pansion. 

The  fact  that  the  Mathieu  potential  is  exactly  soluble 
gives  it  an  advantage  for  testing  purposes  over  working 
with  realistic  potentials.*'*’"  Our  test  complements  the 
empty-lattice  potential"''*’  to  provide  a  stringent  test  for 
the  FP-KKR  theory.  The  strong  angular-momentum 
dependence  in  the  Mathieu  potential  gives  a  good  repre¬ 
sentation  of  the  anisotropy  that  is  present  in  the  open 
structures  pertaining  to  many  semiconductors  and  insula¬ 
tors.  The  restriction  of  the  KKR  to  closed-packed  met¬ 
als  imposed  by  the  muffin-tin  approximation  is  lifted  by 


the  full-potential  method  discussed  here.  The  results  ob¬ 
tained  here  should  encourage  the  application  of  this 
theory  to  real  crystals. 


II.  THE  FULL-POTENTIAL  KKR  EQUATION 


In  this  section  we  want  to  show  that  the  Kohn- 
Rostoker  integral  equation  can  be  simply  extended  to  ob¬ 
tain  the  full  potential  KKR  theory.  The  Schrodinger 
equation  in  the  band  calculation 

(-V'-fK(r)]4(r)=£^k(r)  (1) 


for  a  full  crystal  potential  K(r)  is  equivalent  to  solving 
the  following  integral  equation;' 

0k(£,r)=  /Gk(£:r,r’)F(r')rf>k(£,r’)dr' ,  (2) 

where  the  integration  is  over  the  Wigner-Seitz  cell  of 
volume  T,  and  k  is  a  crystal  wave  vector.  Gk(£;r,r’)  in 
Eq.  (3)  is  the  KKR  free-electron  Green’s  function' 


Gk(£;r,r')  = 


exp[i(K„  +k)'(r— r')) 
(K,+k)'-£ 


(3) 


where  K,  are  the  reciprocal-lattice  vectors.  Alternative¬ 
ly  G^{E ;r,T' )  can  be  expressed  as' 


Gk(£;r,r') 


exp(i>c|r— r’— Rjl) 
lr-r'-R,| 


exp(/k'R, ) , 


(4) 


where  tc—y/E  for  £  >0  and  for  £  <0,  and  R, 

are  the  lattice  translation  vectors.  To  derive  the  FP- 
KKR  equation,  we  first  <^rve  that  Eq.  (3)  can  be  cast 
into  a  surface  integral, 

/jlGk(£;r,r')V>k(£,r') 

-t>k(£,r’)V'Gk(£;r,r')l'dS’=0 ,  (5) 


where  5,  is  the  surface  of  the  Wigner-Seitz  cell. 

Since  the  Ir'l  in  the  surface  integral  exceeds  the 
muffin-tin  radius  r„ ,  we  need  to  consider  the  expansion  of 
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the  Green  function  beyond  the  original  range  of  Kohn 
and  Rostoker.  Several  authors  have  already  considered 
this  point.  For  simplicity,  we  shall  only  consider  the  case 
with  one  atom  per  unit  cell.  Since  this  expansion  is  a  cen- 

_ I 


tral  point  of  controversy,  we  rederive  the  results  explicit¬ 
ly  in  the  Appendix  for  the  range  of  r  and  r’  needed  here. 
We  show  that  the  expansion 


(6) 


Gk(£;r,r')=  2 

L 


SI'"' 

L' 


'Bi,i..(k,£Ui(itTUi'(Kr')+»c6i.iJi(KrWi(Kr')l 


is  valid  as  long  as  both  r  and  r'  are  inside  r  and  satisfy  the 
following  condition: 

|r|<|r'i<|Rj  for  all  iRj^tO  .  (7) 

In  Eq.  (6)  the  notations  y^(Kr)=y/(Kr)y^(r)  and 
Ni^(KT)  =  ni{Kr)Yi^(r)  are  used,  where  j,  and  n,  are,  re¬ 
spectively,  the  spherical  Bessel  and  Neumann  functions, 
Yi  is  a  real  spherical  harmonics,  and  L  represents  the 
double  indices  (/,m).  Bi^i-{k,E)  is  the  usual  KKR  struc¬ 
ture  constant.^' We  note  that  for  any  |r|  smaller  than 
r„,  the  condition  in  Eq.  (7)  is  satisfied  for  all  r'  contribut¬ 
ing  to  the  surface  integration  in  Eq.  (S).  The  condition 
|r'|<|RJ  in  Eq.  (7)  holds  for  most  lattices;  exceptions 
are  those,  for  example,  with  long  narrow  cells.  For  such 
cases,  this  condition  can  be  satisfied  by  breaking  the  unit 
cell  into  smaller  cells  including  so-called  “empty  cells" 
which  do  not  contain  an  atomic  nucleus. 

The  wave  function  inside  the  cell  r  can  be  expanded  in 
a  basis  set  |<l>|^(£,r)j  as 

tfk(£,r)=  2fli(k,£)4>i(£,r) .  (8) 

L 


The  basis  function  4>^(£,r)  is  a  regular  solution  to  the 
Schrodinger  equation  inside  r, 

[-V^+K(r)]<l>i(£,r)  =  £<I>i(£,r)  ,  (9) 

and  behaves  like  y^firr)  at  the  origin  r  —Kq—^O,  which  is 
typically  the  location  of  the  atomic  nucleus.  This  basis 
set  can  be  calculated  using  the  following  integral  equa¬ 
tion:^ 

4>i(£,r)=y^(»fr)-l-  T  / ^gi  (£:r,r')F(r')<I>/,{£,rWV'  , 


where  gj.(£:r,r')  is  a  free-particle  Green's  function  and 
is  defined  as 

g£(£;r,r’)=»f[yi(»frWi(«'')— JV^farUilicr')]  .  (11) 

We  note  that  the  basis  function  is  coupled  to 

other  angular-momentum  channels  for  r  >  Tq.  because  the 
crystal  potential  F(r)  is  not  spherical. 

The  expansions  of  C  in  Eq.  (6)  and  of  tf’  in  Eq.  (8)  can 
be  substituted  in  Eq.  (S)  to  obtain 


2»""'‘’£ij.-(k,£)5ix"<£)  +*fQi..(£)  O|...(k,£)=0,  r  <r„  .  (12) 

L  I"  1'  J 


where 


SL  L"(E)=xfs  lJL  {n'),<t>L4E,T')]  dS'  , 
and 

Qi..(£)=«r [^i.(CT'),<I>i4£,r')]  dS'  . 


(13) 

(14) 


In  the  above,  the  notation  = 

has  been  used.  The  surface  integrals  in  Eqs.  (13)  and  (14) 
are  over  the  boundaries  of  r  as  indicated  by  S^.  Since 
//.(icr)  in  Eq.  (12)  are  linearly  independent  functions,  the 
following  set  of  homogeneous  equations  holds: 


''“'Ai'<k.£)Sii"(£) 


+»tCjj."<£) 


fli  .(k,£)=0  . 


(IS) 


This  is  the  FP-KKR  equation  that  we  are  after. 


We  note  that  our  derivation  is  similar  to  Nesbet’s 
derivation."  We  hope,  however,  that  the  above  explicit 
derivation  may  be  more  accessible  to  some  readers.  It  is 
also  useful  for  establishing  the  notation  necessary  for  the 
description  of  the  application  of  FP-KKR  theory  to  the 
solution  of  the  Mathieu  potential  which  constitutes  the 
main  result  of  this  paper. 

III.  CALCULATION  OF  5  AND  C  MATRICES 

The  surface  integrals  for  the  5  and  C  matrices  in  Eqs. 

(13)  and  (14)  can  be  very  time  consuming.  It  is  desirable 
to  seek  simplification  of  these  calculations  in  a  real  appli¬ 
cation.  One  plausible  approximation  which  is  consistent 
with  the  KKR  spirit  is  to  expand  every  quantity  involved 
in  angular-momentum  components.  Equations  (13)  and 

(14)  are  equivalent  to  the  volume  integrations 

5i.i.(£)=ic  /  JLlKT)V{T)<t>L{E,T)dT  (16) 

and 
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One  can  free  the  limits  of  these  integrals  by  replacing  the 
crystal  potential  Kby  the  truncated  potential 

yUr)=y(T)<T(r) ,  (18) 

where  <7(r)  is  a  step  function  and  is  defined  as 

1,  for  r  within  t 
‘^**^*~  0,  otherwise  . 


The  angular-momentum  expansion  for  the  basis  function 
is  assumed  to  be 

<t>i(£.r)=  2<6t.t(£,r)yi.(r)  .  (20) 

L’ 

and  the  truncated  potential  y^is  expanded  as 

y^(T)=  2  yl{r)YL(T) .  (21) 

L 

The  integrations  in  Eqs.  (16)  and  (17)  can  be  reduced,  re¬ 
spectively,  to  the  simple  radial  integrations 

Sit(£)=*c2  f  jf^xr^y Li.\>'HL"L^E,r)r^dr  (22) 

f  '» 

and 

Cl  l^E)=  -Si,i  +*f  2  f^^»L  (xr)yi.t.4r) 

Xdfx(£,r)r^dr  ,  (23) 

where  r,  is  the  radius  of  the  circumscribing  sphere  of  the 
Wigner-Seiiz  cell.  The  is  given  by 

Ft.i..(r)=  /  yi.(r)K^(r)yi..(r)dn 

=  2  CtV  (24) 

L 

where 

Cli-  =  f  yi.(r)yi(r)yi..(r)dn  (25) 


is  a  Gaunt  coefficient. 

Note  that  in  the  above  the  basis  function  <I>t{£,r)  is 
assumed  to  be  calculated  from  Eq.  (10),  where  Kir)  is  the 
full  crystal  potential.  This  is  the  same  procedure  used  by 
Brown  and  Ciftan  (BC).*  The  original  Williams-Morgan^ 
(WM)  approach,  however,  used  the  truncated  potential 
y  for  the  calculation  of  the  basis  function  in  Eq.  (10).  If 
expansions  of  the  potential  and  in  Eq.  (10)  include  all 
the  angular-momentum  components,  both  approaches 
probably  will  give  the  same  results  for  the  band  structure, 
provided  both  converge."'^'  In  practice,  the  expansion  is 
limited  to  a  certain  therefore,  these  two  approaches 
yield  different  results. 

In  the  actual  calculation  of  the  basis  functions  using  ei- 
*  er  y  or  y  ,  we  first  write  the  potential  as  the  sum  of  Kg 
and  A  y,  where  Kg  is  the  spherical  part  of  the  potential 
and  Ay  is  the  rest.  We  then  solve  for  the  radial  wave 
•unction  /,  corresponding  to  Kq.  Similarly,  the  basis  is 
where  =/t  y^.,  and  Ad*^  is 
ed  from  the  integral  equation 


Ad>i(£;r)=  2  f  gi.(£:r.r')Ay(r')£i(£;r')dr’ 

+ 1  f'gL(E:T.r')y(T')A<Pi(E;r')dT' . 


iteratively  using  angular-momentum  expansions  for  all 
quantities  involved. 

IV.  APPUCATION  TO  MATHIEU  POTENTIAL 

To  test  the  accuracy  of  the  FP-KKR  equation  and  the 
convergence  in  angular  expansion  described  above,  we 
applied  the  theory  to  the  Mathieu  potential'^~'*  of  the 
form 

y(r)=-Uo  cos— -)-cos^-t-cos—  .  (27) 

a  a  a 

where  we  took  the  lattice  constant  a  to  be  2tr  times  the 
Bohr  radius  and  the  potential  parameter  Ug  to  be  0.5  Ry. 
Because  the  potential  is  separable,  the  eigenvalue  prob¬ 
lem  reduces  to  three  one-dimensional  problems.  The 
band  structures  and  corresponding  wave  functions  can  be 
computed  to  the  precision  of  the  computer  and  can  be  re¬ 
garded  as  “exact”  in  the  numerical  comparison. 

The  Mathieu  potential  is  poorly  represented  by  the  MT 
approximation,  because  the  simple  cubic  structure  is 
rather  open  and  the  potential  has  a  large  variation  in  the 


FIG.  I.  Angular-momentum  expansion  of  the  Mathieu  po¬ 
tential  along  (KX)],  [1 10],  and  [1 1 1].  The  solid  circles  represent 
the  continuous  crystal  potential  Fir),  and  the  solid  lines  are  the 
truncated  potential  F^tr).  The  dotted  and  the  dashed  lines  are 
the  sums  of  the  angular-momentum  components  up  to  /  =  8  for 
F(r)and  F'(r),  respectively.  r„,  r,.,  and  r,  are  the  distances  be¬ 
tween  the  origin  and  the  face,  edge,  and  comer  of  the  cube,  re¬ 
spectively.  Notice  that  the  dotted  line  and  the  solid  circles  are 
not  distinguishable  in  the  figure. 
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interstitial  region.  For  example,  with  I/q  set  to  be  0.5  Ry, 
the  MT  constant  potential  is  F,  =  l/ol9/»r<6-ir)] 
==0.501  116  Ry,  while  the  actual  value  of  the  potential 
varies  from  -0.5  Ry  at  (|,0,0)a  to  1.5  Ry  at  ( j,  j,  j)a. 

When  the  full  potential  in  Eq.  (27)  is  expanded  in  cubic 
harmonics  F(r)=2/.  Fi(r)Ai,(r),  V^ir)  is  proportional 
to  -UJiilnr/a),  and  the  series  converges  very  fast. 
With  an  one  can  achieve  a  converged  F(r),  as 

shown  in  Fig.  1.  However,  in  the  expansion  for  the  trun¬ 
cated  potential,  F^(r)=  F(r)a(r)=2£  F/(r)/^i(r),  the 
components 

Vl(r)=  f  KL{T)y{r)0{T)dT  (28) 

have  to  be  carried  out  numerically  with  great  care.  Be¬ 


cause  of  sharp  edges  and  comers  in  F^(r),  the  angular- 
momentum  expansion  is  only  slowly  converging.  This  b 
evident  in  Fig.  1,  which  shows  sizable  errors  made  in  all 
three  directions  [100],  [110],  and  [111]  in  the  expansion 
of  F^up  to  /„„=8. 

We  have  carried  out  the  FP-KKR  calculation  using 
the  wave-function  expansion  in  Eqs.  (10)  and  (24)  up  to 
=4.  The  basis  sets  are  calculated  using  both  the  BC 
and  WM  approaches  with  the  potentials  expanded  up  to 
/o,a,  =  8.  Results  from  the  MT-KKR  approximation  are 
also  obtained  for  comparison. 

In  Fig.  2(a),  the  solid  lines  represent  the  “exact"  band 
structures  for  the  Mathieu  potential.  The  dots  are  the 
MT-KKR  results.  Despite  the  crude  approximation  in 
the  MT  potential,  the  lowest  band  is  still  reasonable.  The 
MT  approximation  becomes  worse  at  the  higher  energies. 


K  (27r/a) 


FIG.  2.  Comparison  of  (a)  the  muffin-tin  KKR  and  (b>  the  FP-KKR  band  structures  (the  dots)  with  the  exact  results  (the  solid 
lines)  for  the  Mathieu  potential.  The  symmetry  points  T,  X,  M,  and  R  correspond  to  the  wave  vector  at  (0,0,0),  ( {,0,0),  ( {.  {•  y  ’• 
({,{,{),  respectively,  in  units  of  2ir/a. 
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as  indicated  by  the  large  energy  deviations  and  splittings 
of  the  levels.  For  example,  the  “exact”  bands  from  F  to 
X  from  r  to  Af  around  0.9  Ry  are  degenerate  due  to 
separability  of  the  Mathieu  potential,  while  the  MT  ap¬ 
proximation  lifts  this  “accidental”  degeneracy. 

The  full-potential  KKR  results  are  compared  with  the 
“exact"  band  structure  in  Fig.  2(b).  The  dots  are  now  the 
FP-KKR  results  and  are  calculated  based  on  the  BC  ap¬ 
proach.  The  agreement  is  excellent  and  rather  uniform 
up  to  1.3  Ry.  The  calculation  even  preserves  the  acciden¬ 
tal  degeneracy  at  F  at  energy  0.88  Ry.  The  lowest  band 
has  a  detectable  deviation  of  0.016  Ry  at  R,  but  has  very 
small  root-mean-square  (rms)  deviation.  The  deviations 
at  R  and  some  other  energy  states  are  probably  due  to  the 
truncation  in  the  angular-momentum  expansions.  The 
i-P-KKR  bands  based  on  the  WM  approach  are  not  no¬ 


ticeably  different  from  those  based  on  the  BC  approach 
plotted  in  Fig.  2(b).  However,  there  are  slight  differences 
between  the  results  of  the  two  approaches.  For  reference, 
we  list  the  deviations  of  both  the  BC  and  WM  ap¬ 
proaches  and  the  “exact"  energies  in  Table  I.  While  the 
WM  approach  gives  a  larger  deviation  in  the  lowest- 
energy  band  around  R,  the  overall  rms  deviations  of  these 
two  approaches  are  similarly  small.  These  results  imply 
some  freedom  in  the  choice  of  basis  functions.  Provided 
that  reasonable  approximations  are  made  in  the  represen¬ 
tation  of  the  cell  potential  and  in  the  calculation  of 
Sli_-  and  from  Eqs.  (21)  and  (22),  it  appears  that  the 
FP-KKR  equation  will  give  reasonable  bands  indepen¬ 
dent  of  the  exact  algorithm  for  obtaining  the  e.g., 
from  P)r),  V^r),  or  other  smooth  potentials  augmented 
to 


FIG.  2.  (Continued). 
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TABLE  I.  Deviations  of  the  FP-KKR  band  energies  A£ 
from  the  exact  values  E,uc,  for  the  Mathieu  potential  at  several 
symmetry  points.  The  subscripts  WM  and  BC  stand,  respective¬ 
ly,  for  the  Williams-Morgan  and  Brown-Ciftan  approaches  de- 
scribed  in  the  text.  All  energies  are  in  Ry. _ 


Symmetry 

slates 

A£»m 

A£|ic 

r, 

-0.0053 

-0.00.34 

-0.3414 

r„ 

0.0243 

0.0223 

0.7517 

r, 

-0.0015 

0.0024 

0.8653 

r,. 

0.0034 

0.0069 

0.8653 

R, 

0.0274 

0.0160 

-0.0827 

^  I? 

0.0089 

0.0064 

0.4097 

/Ivy 

0.0156 

0.0145 

0.9020 

R, 

0.0184 

0.0171 

1.3943 

0.0024 

0.0005 

-0.2551 

X, 

0.0214 

0.0216 

0.2372 

X, 

0.0060 

0.0056 

0.8379 

X, 

-0.0010 

-0.0021 

0.9515 

Xi 

-0.0047 

-0.0036 

0.9515 

M, 

0.0016 

0.0002 

-0.1689 

A/v 

0.0072 

0.0067 

0.3234 

Af, 

0.0270 

0.0259 

0.8158 

M, 

0.0033 

0.0037 

0.9241 

Af, 

0.0058 

-0.0005 

1.0377 

V.  SUMMARY 


functions  expanded  up  to  =4  and  the  potential  up  to 
^iM«  the  FP-KKR  theory  as  described  above  gives  ex¬ 
cellent  results  for  the  bands  in  the  energy  range  needed 
for  solid-state  applications.  With  this  method,  one 
should  be  able  to  deal  with  solids  having  open  structures, 
such  as  semiconductors,  for  which  MT-KKR  is  not  suit¬ 
able. 
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APPENDIX: 

GREEN-FUNCnON  EXPANSION 

Here  we  want  to  show  that  Eq.  (6)  is  valid  when  Eq.  (7) 
is  satisfied.  Following  Kohn  and  Rostoker,^  we  separate 
of  Eq.  (S)  into  two  parts. 


The  main  purpose  of  this  paper  is  to  test  the  accuracy 
of  the  FP-KKR  theory  in  band-structure  calculations. 
To  help  eliminate  doubts  about  this  theory  ,  we  have  de¬ 
rived  the  FP-KKR  equation  explicitly  from  the  Kohn- 
Rostoker  integral  equation.^  This  FP-KKR  theory  still 
preserves  the  clear  separation  between  the  structural  and 
potential  information  possessed  in  the  MT-KKR  equa¬ 
tion.  The  potential  information  is  contained  in  the  S  and 
C  matrices,  which  can  be  easily  calculated  if  the  basis 
functions  and  potentials  are  expressed  in  angular- 
momentum  expansions.  Such  expansions  are  desirable  in 
a  realistic  calculation.  The  whole  procedure  has  been 
tested  against  the  exactly  soluble  Mathieu  potential  in  the 
simple-cubic  structure.  Because  of  the  openness  of  the 
structure  and  the  high  anisotropy  of  the  potential,  this 
potential  provides  a  challenging  model  to  test  against  any 
band-structure  theory.  Our  results  show  that  with  wave 
_ I 


Gk(£;r,r')=go(k,£;r,r')+g|(k,£;r,r')  ,  (Al) 
where  gg  is  the  singular  part, 

and 

g,fk,£;r,r’) 

1  _  cxp(/(tlr-r'-R, I) 

=  -■^2  - 1 _ I - exp(ik-RJ  .  (A3) 


5»^0 


Ir-r'-Rj 


For  r  <  r’  <  £,  and  for  r  and  r'  inside  t,  the  first  part  has 
the  expansion  gQ=  where 

Hl{KT)=Ji{KT)+iNL{Ki).  Under  the  same  condition  for 
r  and  r’,  |r|  <  Ir'  +  R,  I  also  holds  for  a  Wigner-Seitz  cell, 
so  that  the  following  expansion  is  valid 


exp(i>f|r— r'  — Rjl)  _  ♦  _ 

I  _  7--  =-iKj^JLilCT)Hl{K{T'  +  R,)) 

I  f  f  Ri  I  L 

L' 


(A4) 


where 

Vijr.(»cRJ=4ir2i‘'''''‘''c[.i.Wij(*fR,) ,  (A5) 

^2 

where  cti  is  given  in  Eq.  (25).  Therefore  the  Green  function  has  the  expansion  in  Eq.  (6)  with  the  structure  constant 
given  by 

Bi.i.(k, £)=-«>  jbii.-fi''  -''  2Wu(>fR,)exp(ik  R,)  j  .  (A6) 
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ABSTRACT 

Defects  are  a  well  known  source  of  problems  in  low  x  Hgi.j^CdxTe  and  Hgj.xZnxTe. 
To  gain  insight  into  the  relative  importance  of  various  native  point  defects  in  these  mar 
terials  we  have  calculated  the  formation  energy  of  native  point  defects  in  HgTe  using 
a  full  potential  linearized  muffin-tin  orbital  method.  Breathing-mode  relaxations  were 
included.  Formation  entropies  resulting  from  the  change  to  the  phonon  spectrum  upon 
formation  of  a  defect  were  calculated  using  a  valence  force  field  model  for  the  elastic  con¬ 
tributions  and  a  rigid  ion  model  for  the  Coulomb  contributions  to  the  dynamical  matrix. 
The  energies  and  entropies  were  incorporated  into  mass-action  equations  to  deduce  the 
relative  defect  concentration.  In  agreement  with  experiments,  we  find  mercury  vacancies 
are  the  dominant  defect  to  accommodate  excess  tellurium  in  the  lattice  when  the  material 
is  equilibrated  in  the  presence  of  mercury  vapor;  we  predict  that  tellurium  antisites  are 
dominant  in  material  equilibrated  with  telltirium  solid.  Of  the  defects  that  accommodate 
excess  mercury,  we  find  that  mercury  antisites  may  be  more  prevalent  than  previously 
thought.  Extensions  to  the  alloy  are  also  discussed. 


PACS  numbers:  68.35.Md,  68.55.Bd 
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I.  INTRODUCTION 


Defects  are  a  well  known  source  of  problems  in  HgCdTe  and  a  number  of  questions 
concerning  them  remain  unanswered.  For  one,  although  post-growth  anneals  of  as-grown 
p-type  vacancy-doped  material  are  routine,  the  observed  conversion  to  n-type  has  never 
been  fully  satisfactorily  explained,  although  a  residual  donor  is  believed  to  be  the  culprit. 
Also,  a  number  of  studies  on  the  diffusion  of  the  constituents  in  HgCdTe  and  HgZnTe  have 
been  done  but  the  source  of  the  observed  double-diffusion  profiles  is  still  not  understood. 
These  and  other  problems  could  be  better  addressed  if  there  were  a  more  complete  analysis 
of  the  point  defects  in  this  material.  Moreover,  a  comparative  analysis  of  the  defects 
in  HgCdTe  and  HgZnTe  would  facilitate  an  assessment  of  the  relative  merits  of  these 
materials. 

With  the  goal  of  a  better  understanding  of  the  point  defects  in  HgCdTe  and  HgZnTe, 
we  examine  the  properties  of  the  cation  and  anion  vacancies,  antisites  and  interstitial 
atoms  in  HgTe.  In  a  previous  paper  [1]  we  presented  results  using  the  linearized  mufiSn-tin 
orbital  (LMTO)  approach  within  the  atomic  spheres  approximation  (ASA)  to  calculate 
the  defect  formation  energies.  In  this  paper,  we  eliminate  the  ASA  and  use  a  more 
accurate  full-potential  (FP-LMTO)  approach  and  indude  a  breathing-mode  relaxation 
about  the  defect  site.  Additionally,  we  calculate  the  defect  formation  entropy  resulting 
from  the  change  to  the  phonon  spectrum  using  a  Green’s  function  method  with  a  valence 
force  field  and  rigid  ion  model.  We  conclude  this  paper  by  incorporating  our  energies  and 
entropies  into  a  mass-action  model  for  the  defect  concentrations. 


II.  DEFECT  FORMATION  ENERGIES 

The  full-potential  linearized  muffin-tin  orbital  method  (FP-LMTO)  is  used  to  calcu¬ 
late  the  defect  formation  energies  [2].  These  calculations  omit  the  atomic  spheres  approx¬ 
imation  employed  in  our  previous  work  [1],  in  which  the  charge  density  inside  a  sphere 
about  each  atomic  site  is  assumed  spherically  symmetric.  The  FP-LMTO  calculations 
also  permit  us  to  calculate  the  lattice  relaxation  about  the  defect  site.  Defect  formation 
energies  are  calculated  from  a  difference  in  total  energies  of  the  compound  with  and  with¬ 
out  the  defect,  where  the  free  atom  state  is  used  as  the  reference  state,  unless  otherwise 
specified.  Supercells  are  used  in  which  a  periodic  array  of  defects  is  constructed.  Because 
we  wish  to  calculate  the  formation  energies  in  the  dilute  limit,  we  use  the  largest  superoell 
as  is  computationally  feasible.  Details  of  this  work,  including  energy  convergence  with 
cluster  size,  will  be  presented  elsewhere  [3]. 
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Two  forms  of  atomic  relaxation  are  used  in  these  calculations.  The  first  is  a  radial 
breathing-mode  relaxation  of  the  neighbors  nearest  to  the  defect  site.  Second,  an  overall 
relaxation  of  the  lattice  constant  is  included  to  eliminate  the  pressure  on  the  supercell. 
This  second  relaxation  will  approach  zero  as  the  supercell  size  increases  and  the  diliute 
limit  is  approached.  We  are  currently  looking  at  non-breathing-mode  relaxations,  such  as 
the  trigonal  and  tetragonal  distortions  which  split  the  degeneracy  of  the  triply  degenerate 
T2  states  [3].  Because  the  symmetry  of  the  distortion  depends  on  the  ionization  state  of 
the  defect,  distortions  and  charge  states  must  be  treated  simultaneously. 

The  localized  states  associated  with  defects  are  among  the  most  important  and  in¬ 
teresting  of  defect  properties,  but  these  properties  are  difficult  to  calculate  using  modem 
first- principles  theories.  The  local  density  approximation  (LDA),  on  which  the  LMTO 
is  based,  is  inadequate  to  predict  the  semiconductor  band  gaps;  higher-order  corrections 
to  the  many-body  equations  must  be  included  to  correct  the  band-gap  problem.  Conse¬ 
quently,  in  the  present  calculations  we  have  not  attempted  to  identify  the  location  of  the 
localized  states  in  the  gap. 

The  present  calculations  are  performed  using  the  Harris-Foulkes  approximation,  in 
which  atomic  charge  densities  are  superposed  and  the  resulting  charge  density  and  poten¬ 
tials  are  used  to  solve  the  LDA  equations.  This  approximation  has  been  found  to  be  in 
good  agreement  with  self-consistent  results  for  bulk  semiconductor  and  metal  properties 
[2].  Self-consistent  calculations  were  abo  done  to  test  the  validity  of  the  Harris-Foulkes 
prescription  for  defect  total  energies.  Agreemmt  within  less  then  0.2  eV  was  found  for  all 
defects  studied. 

We  calculate  the  defect  formation  energies  for  the  following  reactions: 


+  HgTe  ^  VngTe  +  Hgg 

(1) 

Ev.j,,  +  HgTe  -►  HgV-Te  +  Teg 

(2) 

®Hgxe  +  HgHgjg  +  Teg 

(3) 

+  Teg  +  HgTe  Teg^Te  -I-  Hgg. 

(4) 

The  reference  states  are  chosen  so  that  the  number  of  unit  cells  on  both  sides  of  the 
reactions  are  equal  and  excess  atoms  are  accommodated  in  their  free  atomic  state.  Other 
reference  states  of  interest  can  be  obtained  by  considering  the  additional  defect  reactions: 
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^HgTe  +  HgTe  «  +  Hgg, 


(5) 


Ereg  +  5(Te2)g  Teg  (6) 

and 

Exeii  +  Tea  «-►  Teg.  (7) 

In  Eq8.(l)-(7)  Ag  corresponds  to  an  A  atom  or  defect  on  a  B  site,  where  V  is  a  vacancy, 
g  the  free  atom  state,  and  s  the  elemental  solid.  Our  results  for  the  energy  of  Eqs. 
(l)-(5)  are  given  in  Table  I.  The  predicted  cohesive  energy  of  4.1  eV  is  larger  than  the 
experimental  value  of  3.3  eV  and  is  due  to  underlying  errors  in  the  LD  A  formalism.  We  are 
currently  considering  a  number  of  approaches  that  would  allow  us  to  correct  the  numbers 
in  Table  I  for  this  LDA  error.  Calculations  were  done  for  8,  16,  and  32  atom  supercells; 
for  all  defects,  convergence  is  found  within  less  than  0.5  eV.  We  are  currently  examining 
larger  supercell  sizes  to  better  understand  the  convergence  to  the  dilute  limit,  which  we 
wish  to  imitate;  results  will  be  presented  elsewhere  [3]. 

Nearest-neighbor  radial  relaxations  are  also  given  in  Table  I.  The  near-neighbor  relax¬ 
ations  are  found  to  be  nearly  independent  of  supercell  size.  Both  the  Hg  and  Te  antisites 
are  too  big  to  fit  into  the  lattice  and  result  in  an  outward  relaxation  of  their  nearest  neigh¬ 
bors.  In  contrast,  the  Hg  and  Te  vacancies  result  in  an  inward  relaxation  of  the  nearest 
neighbors.  These  results  are  in  contrast  to  results  for  the  silicon  vacancy  [4]  where  an 
outward  relaxation  was  found. 

III.  DEFECT  FORMATION  ENTROPIES 

Defect  formation  entropies  are  calculated  from  the  change  to  the  phonon  spectrum 
resulting  from  the  introouction  of  the  defect.  A  valence  force-field  model  within  the 
rigid  hybrid  approximation  is  used  to  calculate  the  elastic  contributions  to  the  dynamical 
matrix  in  the  harmonic  approximation,  and  experimental  elastic  constants  are  used.  For 
II- VI  semiconductors,  we  also  include  a  Coulomb  contribution  to  the  djmamical  matrix 
which  results  from  the  ionic  nature  of  the  bonding.  Unlike  the  elastic  contributions  to 
the  dynamical  matrix,  the  Coulomb  contributions  are  long-range  in  nature  and  induce  a 
macroscopic  electromagnetic  held,  which  results  in  a  screening  of  the  transverse  optical 
phonons.  The  ionic  charge  is  chosen  to  yield  agreement  with  experiment  for  the  zone 
center  splitting  of  the  transverse  and  longitudinal  optical  phonons.  The  phonon  dispersion 
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curves  calculated  from  the  dynamical  matrix  are  in  fair  agreement  with  the  experimental 
dispersion  curves.  The  discrepancies  with  the  experimental  curves  are  largely  due  to  the 
absence  of  long-range  elastic  interactions  in  the  near-neighbor  valence  force-field  model 
employed. 

A  Brillouin  zone  integration  is  done  to  calculate  the  phonon  density-of-states  for  the 
ideal  crystal  without  a  defect.  From  the  density-of-states  matrix,  p,  the  Great’s  function 
can  be  calculated  from 


G0(a,2)  =  /““  +  ^(u.^)ln(-“'-“f»-!’')  (8) 

Min  - 11/  > 


where  the  singularity  in  the  integral  has  been  explidtly  removed.  The  change  in  the  total 
density-of-states  when  a  defect  is  introduced  into  the  crystal  can  be  deduced  from  Dyson’s 
equation  to  obtain 


A^(u,2)  =  ilmj^(ln(det(l  -  G  VW))  (9) 

where  V  is  the  perturbation  potential.  In  the  present  case,  we  use  a  strictly  site-diagonal 
perturbation  potential  for  the  mass  change  due  to  the  introduction  of  an  isolated  defect. 
The  resulting  entropy  change  is  given  by 

where  k  is  Boltzmann’s  constant  and  %  is  Planck’s  constant  divided  by  2ir.  Calculated 
entropy  changes  for  the  various  defect  reactions  in  Eqs.  (l)-(5)  are  given  in  Table  I.  Fur¬ 
ther  details  of  the  calculations,  including  defects  with  both  on  and  off-diagonal  disorder, 
are  given  elsewhere  [5]. 

IV.  DEFECT  EQUILIBRIUM 

In  the  following  analysis  we  consider  only  native  point  defects  in  bulk  HgTe.  Although 
surfaces  will  play  an  important  role  in  determining  the  properties  of  HgCdTe,  in  particular 
for  he  epitaxial  films,  we  will  not  discuss  those  effects  here.  Furthermore,  we  consider  only 
the  neutral  defects  in  the  analysis,  although  the  ionized  states  are  certainly  important 
for  many  defects,  and  will  affect  the  defect  concentrations.  Finally,  full  equilibrium  is 
assumed,  i.e.,  the  material  is  assumed  to  have  been  annealed  for  a  sufficiently  long  time. 

For  the  analysis  of  the  defects,  we  will  find  it  convenient  to  reference  all  of  the  defects 
to  pure  HgTe  and  Hg  in  the  vapor  state.  The  above  reaction  become: 
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E'vhj  +  HgTe  VggTe  +  Hgg 


(11) 


E'Vxe  ^  (12) 

®*HgTe  +  ^  HgHg-Te  (13) 

Ten,  ■*■  '^®HgTe  +  2Hgg  (14) 

In  addition  to  these  reactions,  we  also  include  those  corresponding  to  the  interstitial 
mercury  and  tellurium.  These  are: 


E'Hgi  +  Hgg  Hgi  (15) 

and 

E*Tei  +  HgTe  ♦-»  Tej  +  Hgg  (16) 

The  subscript  I  refers  to  the  interstitial  site.  Entropies  and  energies  for  these  two  defect 
reactions  are  given  in  Table  I.  We  have  not  calculated  the  change  to  the  phonon  spectrum 
from  adding  an  interstitial  atom  to  the  solid;  we  approximate  it  by  equating  the  entropy 
of  an  interstitial  atom  to  that  of  an  atom  on  its  ideal  lattice  site.  The  energies  for  the 
reactions  in  Eqs.  (15)  and  (16  )  have  been  taken  from  our  previous  ASA  calculations  [6]. 
Because  of  the  large  lattice  distortions  induced  by  the  interstitial  atoms,  these  calculations 
are  more  difficult  to  perform  than  for  the  other  defects.  Results  for  interstitial  mercury 
and  tellurium  using  the  FP-LMTO  will  be  presented  elsewhere  [3].  Other  defects  such 
as  the  Schottky  and  FVenkel  type  can  be  obtuned  by  taking  sums  and  differences  of  the 
above  reactions,  Eqs.  (11)-(16). 

The  reaction  constants  corresponding  to  the  above  reactions  are: 

Kvh,  =  (kT)!(2TniH,)Jh-5«p(2^)«p^;i^j  =  [VhjIPh,  (17) 

Kv^  =  =  [V^JPg*  (18) 

=  (kT)-®(2«-mHj)-7h*exp^55Sli^fliI'^exp^-^fl»j  =  IHgiJPjJ  (19) 
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Kjen,  =  (kT)5(2?rinHg)^h-®exp  ^  =  [TcHglP^g  (20) 

^Hg,  =  (kT)-5(2TmHg)-^h3exp^^^exp^-|i^^  =  PgllP^g  (21) 

and 

Kt.,  =  (kT)!(2TmH,)ih-»expfeW(^^)  =  (TciIPh,.  (22) 

Here  T  is  the  temperature  in  Kelvin,  Pgg  is  the  mercury  pressure  in  atmospheres,  h  is 
Planck’s  constant,  and  m^g  is  the  mass  of  mercury.  The  square  brackets  indicate  the 
defect  site  fraction. 

We  will  first  consider  the  defects  that  accommodate  excess  tellurium:  V^g,  Tegg,  and 
Tej.  Evaluating  the  above  mass-action  relations  for  the  solid  in  equilibrium  at  T=250*C 
and  Pfig^l 


(23) 

H--” 

(24) 

We  can  see  firom  these  equations  that  the  m^cury  vacancy  is  the  dominant  native  point 
defect  to  accommodate  excess  tellurium  ard  that  the  tellurium  antisite  and  interstitial 
tellurium  densities  are  negligible  in  comparison  for  the  given  anneal  conditions.  Even 
at  higher  temperatures,  near  500*G,  the  vacancy  is  the  dominant  defect  and  exceeds  the 
tellurium  antisite  concentrations  by  ~  10^ 

Next  we  consider  the  equilibrium  of  the  material  with  elemental  solid  tdlurium.  This 
will  be  of  interest  if  the  material  contains  tellurium  precipitates  and  the  region  near  the 
precipitate  is  in  equilibrium  with  this  solid  tellurium  phase.  Rewriting  the  mass-action 
equations  to  reference  all  defects  to  the  telltirium  solid,  we  find  at  T=250*C  and  PHg=^ 
atm  that 

>^  =  10^  (25) 

and 


(26) 


The  result  in  E)q.  (25)  indicates  that  the  concentration  of  tellurium  antisites  will  exceed 
that  of  mercury  vacancies  in  the  vicinity  of  the  tellurium  precipitate.  This  may  have 
interesting  consequences  for  the  properties  of  partially  annealed  material  when  tellurium 
precipitates  are  present.  Perhaps  a  more  relevant  situation  to  consider  is  material  which 
has  been  equilibrated  at  higher  temperatures,  which  is  then  subsequently  annealed  to 
remove  the  Hg  vacancies.  For  equilibration  at  500*  C  we  find 

>§^  =  1000.  (27) 

Even  at  this  elevated  temperature  the  tellurium  antisite  density  is  expected  to  exceed  the 
mercury  vacancy  density  near  teUurium  inclusions.  We  would  expect  that  the  tellurium 
antisites  to  be  less  mobile  than  mercury  vacancies  and  thus  be  more  difficult  to  anneal. 
Therefore,  if  the  material  is  slow  to  anneal  at  the  lower  annealing  temperatures  because 
of  low  mobility  of  the  tellurium  antisite,  an  atmosphere  of  a  significant  antisite  density 
will  be  present  about  the  tellurium  inclusions.  The  size  of  these  atmospheres  will  depend 
on  the  dynamics  of  the  growth  process,  since  this  is  a  nonequilibrium  situation.  If  such 
atmospheres  of  tellurium  antisites  are  present,  we  can  expect  that  they  will  result  in  some 
interesting  properties.  For  example,  if  the  tellurium  antisites  are  donors,  regions  of  n- 
type  material  may  be  present  in  the  otherwise  p*type,  vacancy-doped  material  at  high 
temperatures.  If  the  tellurium  antisites  are  acceptors,  regions  of  excessive  p-type  carrier 
concentration  will  exist  in  the  p-type,  vacancy  doped,  material.  If  the  tellurium  antisites 
were  slow  to  anneal  at  low  temperatxires,  subjecting  the  material  to  a  low  temperature 
anneal  could  leave  regions  of  relatively  high  n-  or  p-  type  character. 

Next,  we  consider  the  defects  that  accommodate  excess  mercury  in  the  lattice.  Using 
the  Hg  vapor  as  the  reference  state,  for  material  equilibrated  at  T=250*C  and  PHg=l 
atm  we  find 


and 


t^gTel  10^7 
[VTel  ■ 


(28) 


(29) 


We  thus  conclude  that  the  the  mercury  antisite  is  the  dominant  defect  that  accommodates 
excess  mercury  in  the  lattice.  Consequently  we  find  that 


W-“' 


(30) 
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Diffusion  measurements  in  HgCdTe  indicate  that  interstitial  mercury  is  the  dominant 
diffusing  species  at  high  mercury  pressures[7].  Even  though  we  predict  them  to  be  a 
majority  species,  the  mercury  antisites  may  not  contribute  to  diffusion  because  they  are 
expected  to  be  far  less  mobile  than  interstitial  mercury  and  their  diffusion  will  involve 
tellurium  vacancies  or  interstitial  tellurium,  both  of  which  are  energetically  costly  to 
create.  Additionally,  although  the  mercury  antisites  are  predicted  to  be  the  dominant 
equilibrium  species,  this  equilibrium  may  not  be  realized  because  the  equilibration  time 
for  antisites  may  be  substantially  longer  than  for  interstitial  atoms. 

The  above  results  are  for  the  compound  HgTe.  The  question  remains  as  to  the  exten¬ 
sion  of  these  results  to  the  alloys  HgZnTe  and  HgCdTe  for  the  mercury-rich  regimes.  In 
a  previous  paper  [8],  using  a  tight-binding  model  we  showed  that  the  vacancy  formation 
energies  are  sensitive  to  the  alloy  environment,  in  particular  for  vacancies  of  the  common 
constituent,  i.e.,  tellurium  vacancies  in  HgCdTe  and  HgZnTe.  Even  for  vacancies  of  the 
substituted  species,  for  which  the  alloy  environment  lies  in  the  second  neighbor  shell  and 
beyond,  we  found  the  vacancy  formation  energies  to  vary  in  going  from  one  environment 
to  another,  by  several  tenths  of  an  electron  volt.  We  previously  found  that  the  Hg  va¬ 
cancy  formation  energy  is  smaller  in  Hgj.xCdxTe  than  Hgi.^ZnxTe  for  their  respective 
X  corresponding  to  a  band  gap  of  0.1  eV,  and  indicating  lower  vacancy  concentrations 
in  HgZnTe.  For  comparisons  of  the  relative  defect  densities  done  above,  one  needs  the 
variation  of  the  antisite  and  interstitial  atom  energies,  which  are  also  expected  to  depend 
on  the  alloy  environment. 

Several  improvements  to  the  present  calculation  are  under  way  and  will  be  presented 
elsewhere  [3].  First,  we  still  must  confirm  the  convergence  of  our  calculations  by  going  to 
even  larger  supercells.  We  anticipate  that  the  shifts  to  the  energies  will  be  comparable 
for  all  the  defects,  and  therefore  conclusions  based  on  ratio  of  d^ect  densities  will  be 
unchanged.  Second,  as  discussed  above,  a  correction  for  the  LDA  error  would  help  in  im¬ 
proving  the  present  results  and  allow  us  to  more  reliably  bench-mark  our  energies  against 
experimental  numbers.  Third,  we  have  included  only  the  breathing-mode  distortion  for 
the  defects.  Inclusion  of  Jahn-Teller  distortions  can  lower  the  formation  energy,  but  since 
they  are  charge-state  dependent,  we  have  not  done  so  in  the  present  work.  For  the  neutral 
defects,  we  do  not  expect  that  this  will  alter  the  present  conclusions.  A  combination  of 
the  Jahn-Teller  distortion  and  the  LDA  error  is  a  likely  explanation  of  our  discrepancy 
with  the  experimental  values  for  the  mercury  vacancy  of  2.24  eV  [9].  Finally,  the  defect 
equilibrium  must  be  done  for  the  appropriate  charge  state  of  the  system,  while  the  present 
analysis  was  done  for  the  neutral  defects  only. 
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Table  I.  Defect  formation  energies  for  the  defect  reactions  indicated  in  text. 

Energies  shown  in  parentheses  are  from  ASA  calculations,  published  by  Berding  et  al.  [6];  an 
average  of  the  anion  and  cation  tetrahedral  interstital  sites  has  been  used  here.  Defect  entropy 
changes  include  the  phonon  contrftxjtion  only.  Radial  relaxations  are  the  change  in  the  distance 
between  the  center  of  the  defect  site  and  Its  near  neighbors. 


Formation  Energy 
(eV) 

Near-Neighbor 
Radial  Relaxation 

S 

(i0-«eV/K) 

E 

E' 

(A) 

500  C 

VHg 

2.6 

2.6 

-0.14 

-66.3 

-76.4 

Vt. 

5.8 

1.7 

-0.17 

-60.4 

-70.4 

HffTe 

3.4 

-0.7 

0.06 

5.5 

5.5 

TeHg 

-0.1 

4.0 

0.20 

-5.8 

-5.8 

HgTe 

4.1 

... 

... 

-135.5 

-155.4 

Hqi 

... 

(0.88) 

... 

-72 

-82 

Tei 

... 

(4.96) 

... 

—72 

—82 
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A  tight-binding  model  is  used  to  calculate  the  formation  energies,  bond  lengths,  and  bulk  moduli 
of  a  number  of  ordered  III-V  and  II-VI  semiconductor  alloys.  The  parameters  in  the  model  are  ad¬ 
justed  so  that  the  bond  lengths,  cohesive  energies,  bulk  moduli,  and  shear  elastic  constants  for  the 
constituent  compounds  are  described  correctly.  The  model  is  then  applied  to  alloys  without  further 
adjustment.  Ba^  on  the  calculated  excess  energies,  we  conclude  that  none  of  the  ordered  alloys 
found  experimentally  is  in  its  suble  bulk  equilibrium  state  at  the  growth  temperatures.  Although 
the  alloy  excess  energies  can  be  negative,  if  the  leference  constituent  compounds  are  constrained  to 
match  the  substrate  lattice  used  in  epitaxial  growth,  their  magnitudes  are  not  large  enough  to  ac¬ 
count  for  the  observed  ordering.  A  possible  explanation  of  the  observed  states  in  terms  of  a  barrier 
that  prevenu  the  metasuble  ordered  alloy  from  decomposing  into  separate  phases  is  presented. 
However,  this  explanation  only  applies  to  alloys  with  lattice-mismatched  constituents.  Detailed  re¬ 
sults  on  the  bond  lengths  and  bulk  moduli  are  also  discussed. 


L  INTRODUCTION 

The  bulk  semiconductor  alloys  were  long 

thought  to  be  ideal  pseudobinary  compounds,  in  which 
the  C  atoms  sit  in  a  fee  .sublattice  while  the  substituting 
atoms  A  and  B  randomly  occupy  the  sites  of  the  other  fee 
sublattice.  However,  several  recent  6ndings  in  the  last 
few  years  revealed  quite  a  different  picture.  First,  extend¬ 
ed  x-ray-absorption  fine-structure  (EXAFS)  experi¬ 
ments''^  clearly  showed  a  bimodal  distribution  for  the 
nearest-neighbor  bond  lengths  in  these  alloys,  which  im¬ 
plies  that  the  equilibrium  atomic  positions  are  not  the 
virtual-crystal  sites  of  the  zinc-blende  crysul.  Recent  ex¬ 
periments^'*  and  theories’’*  also  indicated  that  the  ar¬ 
rangement  of  the  alloying  atoms  in  these  systems  is  not 
completely  random.  Most  surprising  of  all,  however,  has 
been  the  discovery  of  long-range  ordering  (LRO)  in  these 
alloys  grown  epitaxially.'^'*  Essentially,  all  the  III-V  al¬ 
loys  grown  by  molecular*beam  epitaxy  (MBE)  or  metal 
organic  chemical  vapor  deposition  (MOCVD)  under 
some  H)ecial  gronrth  conditioiis  are  found  to  be  ordered. 
While  a  great  majority  of  these  ordered  alloys  form  the 
ABCi  compounds  in  one  or  more  of  the  ftdlowing  three 
crystal  structures;  the  CuPt  (CP),  <3uAu  I  (CA),  and 
chakopyrite  (CH),  a  few  alloys  are  ordered  in  the  form  of 
AjBC^  with  the  famatinite  or  luzonite  structure.  These 
crystal  structures  have  been  well  described  by  Wei 
et  al.^  Table  I  is  a  partial  list  of  the  LRO  alloys  that 
have  been  grown,  together  srith  the  growth  conditions 
and  ordered  structure  found.  For  later  emphasis,  we  note 
that  the  substrate  temperatures  for  the  ordering  to  occur 
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range  from  AOO'C  to  800 *C,  and  the  ordering  directions 
are  not  necessarily  the  same  as  the  growth  direction. 

Finding  LRO  is  surprising,  because  it  is  at  variance 
with  the  well-esublished  conventional  picture  for  the 
bulk  semiconductor  alloys  having  simple  phase  diagrams 
with  miscibility  gaps*'”**  driven  by  strain  energy.  The 
question  is,  are  these  ordered  alloys  thermodynamic  equi¬ 
librium  states?  This  question  can  be  answer^  if  accurate 
values  for  the  alloy  excess  energies  can  be  determined. 
To  be  specific,  we  shall  only  consider  three  important 
structures,  CP,  CA,  and  CH,  for  the  ABC2  alloys,  and 
define  an  excess  energy  BE  as 

BE=E{ABCi)-lE{AC)+EiBC)],  (1) 

where  £  ( ABCj )  is  the  energy  per  molecule,  or  per  four 
atoms,  in  the  aUoy  ABC2,  and  EiAC)  and  EiBC)  are 
the  energies  per  pair  of  atoms  in  the  AC  and  BC  zinc- 
blende  compounds,  respectively.  If  these  ordered  alloys 
are  in  their  thermodynamic^ly  stable  states  at  the 
growth  temperature,  A£  has  to  be  native  and  must 
have  a  magnitude  considerably  larger  than  200  meV  on 
the  present  scale.  This  would  be  in  contradiction  with 
the  positive  values  of  BE  previously  reported  for  the  bulk 
semiconductor  alloys.’'"^'**  However,  the  ordered 
semiconductor  alloys  found  from  the  epitaxial  growth 
may  be  in  a  constrained  equilibrium  state,  where  the  con¬ 
straint  is  imposed  by  the  substrate  strain.  We  need  to 
know  the  energetics  of  the  various  states  involved  besides 
the  bulk  access  energy  before  we  can  start  undersunding 
the  ordering  and  stability  of  these  alloys.  The  calculation 
of  some  of  these  energies  along  with  an  accuracy  analysis 
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TABLE  1.  A  list  of  the  ordered  Hl-V  semiconductor  alloys  identified  experimentally. 


Alloys 

Structure 

Growth 

method 

Substrate 

Temperature  (’C) 

Reference 

AIGaAS} 

CA 

MOCVD  and  MBE 

GaAsdOO)  and  GaAs(llO) 

bOa-800 

Ref.  7 

AllnP, 

CP 

MOCVD 

GaAstOOl) 

650-700 

Ref.  13 

AllnASj 

CP 

MOCVD 

InP(OOI) 

600 

Ref.  9 

GalnPj 

CP 

MOCVD 

GaAs(OOI) 

630 

Ref.  14 

CP 

MOCVD 

GaAs(OOl) 

640 

Ref.  15 

CP 

MOCVD 

GaAsCOOl) 

650-700 

Ref.  13 

CP 

MOCVD 

GaAs(OOI) 

600-630 

Ref.  16 

CP 

MOCVD 

GaAs(OOl) 

600-700 

Ref.  17 

GalnAs^ 

famatinite 

LPE 

InPdlO) 

630 

Ref.  8 

luzonite 

MBE 

InPtOOl) 

400 

Ref.  10 

CA 

MBE 

InPdIO) 

500 

Ref.  11 

CP 

VLE 

InP(OOI) 

650-660 

Ref.  12 

GsjAsSb 

CA 

MOCVD 

InPdOO) 

550-680 

Ref.  18 

CH 

MOCVD 

InPdOO) 

600 

Ref.  18 

CP 

MBE 

540 

Ref.  19 

is  the  main  purpose  of  this  paper. 

The  excess  energies  for  a  number  of  ordered  semicon¬ 
ductor  alloys  have  been  calculated  from  local-density- 
functional  (LDF)  theory  using  various  band-structure 
methods. Although  the  LDF  error  for  the 
cohesive  energy  of  a  III-V  compound  is  typically  several 
tenths  of  one  eV  per  pair  of  atoms,  it  is  generally  believed 
that  the  excess  energy  based  on  the  same  technique  is  ac¬ 
curate  to  several  meV,  because  the  errors  in  LDF  cancel 
in  Eq.  (1).  However,  the  ordered  alloys  that  we  are  con¬ 
sidering  are  open  structures  containing  several  atoms  per 
unit  cell.  The  atomic  positions  in  these  alloys  are  usually 
distorted  away  from  the  regular  zinc-blende  sites.  Only 
the  most  sophisticated  band-structure  theories  which  are 
capable  of  treating  shear  distortion,  such  as  the  full- 
potential  linear  combination  of  muffin-tin  orbitals  (FP- 
LMTO),^^  full  potential  augmented  plane  waves  (FP- 
APW),^  and  the  fully  converged  plane-wave  pseudopo¬ 
tential  method,^’  can  be  expected  to  yield  precise  results 
within  LDF.  Even  with  present-day  computers,  it  is  still 
too  expensive  to  use  these  methods  to  perform  calcula¬ 
tions  over  a  wide  range  of  semiconductor  alloys.  On  the 
other  hand,  although  the  valence-force-iield  (VFF)  mod¬ 
el^' is  simple  and  is  effective  in  treating  the  strain  ener¬ 
gy,  it  cannot  account  for  the  chemical  energy.^*  These 
considerations  have  motivated  us  to  use  the  empirical 
tight-binding  (ETB)  method.  The  ETB  not  only  can  treat 
both  the  strain  and  chemical  energies  but  also  allows  for 
precise  and  systematic  computations.  To  eliminate  the 
propagation  of  errors  from  constituent  compounds  to  al¬ 
loys,  the  parameters  in  ETB  are  adjusted  to  produce  the 
experimental  values  for  the  cohesive  energy,  bond  length 
d,  bulk  modulus  B,  and  the  shear  elastic  constant 
C||  — C|2  for  each  constituent  crystal.  These  parameters 
are  then  used  in  the  alloy  calculation  without  further  ad¬ 


justment.  Our  method  thus  corresponds  to  an  interpola¬ 
tion  scheme  for  the  alloys  between  the  constituent  com¬ 
pounds.  Such  an  approach  is  particularly  appropriate  for 
the  present  study,  because  the  alloys  and  the  constituent 
crystals  have  very  similar  structures  and  local  bonding. 

The  rest  of  this  paper  is  arranged  as  follows:  Section  II 
describes  the  ETB  model,  the  way  the  parameters  are 
determined,  and  the  results  for  the  structural  properties 
of  the  constituent  crystals.  Section  III  briefly  describes 
the  structural  parameters  and  the  energy-minimization 
procedure  for  the  three  alloy  structures  in  both  their  bulk 
equilibrium  states  and  in  states  constrained  to  match  sub¬ 
strates.  The  calculated  excess  energies  are  summarized  in 
Sec.  IV  and  are  compared  with  those  from  LDF  and 
VFF.  To  provide  more  deUiled  structural  information, 
the  calculated  bond  lengths  and  bulk  moduli  of  the  alloys 
are  also  presented.  The  final  section.  Sec.  V,  contains  a 
summary  and  discussion. 


U.  TIGHT-BINDING  MODEL 

The  tight-binding  fTB)  model  that  we  are  going  to  use 
is  very  similar  to  that  used  by  Chadi“  and  Harrison.”’** 
The  total  energy  of  a  semiconductor  crystal  is  assumed  to 
be  the  sum  of  the  electron  energies  E„(k)  in  the  valence 
bands  and  the  pair  repulsive  energies  between  the 
nearest-neighbor  atoms; 

£r=£Bs  +  I/r=22;E.(k)+22«,>  •  (2) 

0  *  i  >  J 

Furthermore,  the  band  energies  are  calculated  using  a 
minimum-basis  TB  Hamiltonian  which  includes  one  s 
and  three  p  orbiuls  per  atom.  The  interaction  parame¬ 
ters  needed  from  the  Hamiltonian  are  the  term  values  e. 
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and  Cp  of  each  atom  and  the  nearest-neighbor  two-center 
interactions  and  For  a  given  crys¬ 

tal  structure,  the  Hamiltonian  H(k)  associated  with  a 
given  wave  vector  k  within  the  first  Brillouin  zone  has  a 
dimension  of  4m,  where  m  is  the  number  of  atoms  per 
unit  cell.  The  diagonal  part  of  H(k)  consists  of  the  term 
values,  and  the  off-diagonal  elements  are  computed  as 

//yy.(k)=2e'‘‘'’'’’7i,/d„.),  (3) 

where  the  sum  runs  over  the  first-neighbor  bond  displace¬ 
ments  dyy-  that  point  from  the  orbitals  denoted  y  to  y'. 
The  y  stands  for  the  s,  p^.Py,  or  p,  orbital  of  a  particular 
atom  in  a  given  unit  cell.  Hie  matrix  elements  hyy-  are  re¬ 
lated  to  the  two-center  interactions  by  the  Slater-Koster 


relations,*' 

(4) 

.  (6) 
hyy^a,a,(V„,-V„,) .  (7) 


where  a,'=x,Vd  are  the  direction  cosines  of  d,  while  all 
the  P's  depend  only  on  the  length  d. 

The  first  task  is  to  determine  the  forms  for  the  interac¬ 
tions  and  the  repulsive  energy  u  from  the  constitu¬ 
ent  compounds.  Since  the  strain  energy  plays  a  very  im- 
porunt  role  in  the  alloy  formation  energy,  we  shall  make 
sure  that  our  model  produces  the  correct  elastic  con- 
sunts.  To  keep  the  model  close  to  Harrison’s”-*®  origi¬ 
nal  form,  but  to  free  it  from  his  rigid  l/d^  and  1/d*  scal¬ 
ing  rules  for  and  a,*  respectively,  we  assume  the  fol¬ 
lowing  forms: 

y„a-(d)==V'^.(do/d)'’  (8) 


and 

uld)=Uo{df,/d)'" ,  (9) 

where  the  superscript  and  subscript  0  indicate  the  values 
evaluated  at  the  equilibrium  bond  length  dg-  values 

of  are  taken  to  be  Harrison’s  universal  forms  scaled 
by  a  factor  /, 

•  00) 

Note  that  Harrison’s*®  universal  two-center  interactions 
take  the  form 

where  m  is  the  free-electron  mass,  ff  is  Planck’s  constant, 
and  the  ij’s  take  the  following  values:  — 1.32, 

1-42,  ti„„  =  2.22,  and  -0.63. 

Thus  there  are  four  adjustable  parameters  for  each  sys¬ 
tem:  the  scaling  parameter  /,  the  powers  n  and  m,  and 
the  value  Mq-  These  parameters  are  determined  by  re¬ 
quiring  that  the  model  produce  the  correct  experimental 
values  for  the  bond  energy  dg,  B,  and  the  shear 

elastic  constant  C||— Cij.  Since  ^11  ^12  is  governed 
only  by  in  the  present  model,  it  alone  determines  the 
scaling  factor  /.  Then  the  bond  energy  is  used  to 
determine  Ug.  The  requirement  that  the  first  derivative 
of  £7  be  zero  at  dg  then  determines  the  ratio  of  the 
powers  n/m,  which  couples  with  the  equation  for  the 
bulk  modulus  to  yield  the  values  for  n  and  m.  One  can 
then  use  these  sets  of  parameters  to  calculate  other  quan¬ 
tities  that  are  not  employed  in  the  fitting,  e.g.,  another 
shear  elastic  constant  C44,  the  Kleinman  internal- 
displacement  parameter*’  and  the  optical-phonon  fre¬ 
quencies  a>  at  the  zone  center,  to  check  the  validity  of  the 
model. 

In  the  actual  calculations  we  used  the  term  values  tab¬ 
ulated  by  Chen  and  Sher.*’  Table  II  lists  the  experimen- 


TABLE  II.  Values  of  bond  length  d,  bond  energy  £boMi>  hulk  modulus  B,  and  shear  coefficient 
C  =  C|,  —  C,]  used  to  determine  the  parameters  in  Table  III.  Also  listed  are  the  experimental  values  of 
C44  and  the  TO-phonon  mode  <0  at  T  to  be  compared  with  the  calculations.  All  the  elastic  constants 
are  in  uniu  of  10"  dyn/cm^  d  in  A,  Etoma  in  eV,  and  <i>To  »  in  terms  of  wave  numbers  in  cm~ '. 


The  sources  of  these  values  are  discussed  in  Ref.  47.  Also  listed  are  the  force  constants  a  and  P  (N/m) 
for  the  valence-force-field  model  in  Eg.  (13). _ 


d 

Ebomt 

B 

C 

Cm 

w 

a 

P 

AlP 

2.367 

-2.13 

8.600 

6.900 

6.150 

440 

43.867 

9.429 

AlAs 

2.431 

-1.89 

7.727 

7.160 

5.420 

361 

40.360 

10.132 

AlSb 

2.656 

-1.76 

5.817 

4.428 

4.076 

366 

33.417 

6.790 

GaP 

2.360 

-1.78 

9.143 

7.870 

7.143 

367 

46.257 

10.723 

GaAs 

2.448 

-1.63 

7.690 

6.630 

6.040 

269 

40.351 

9.371 

GaSb 

2.640 

-1.48 

5.792 

4.946 

4.440 

231 

32.800 

7.539 

InP 

2.341 

-1.74 

i.i*n 

4.460 

4.600 

304 

40.346 

6.543 

InAs 

2.622 

-1.35 

5.794 

3.803 

3.959 

219 

33.165 

5.757 

InSb 

2.803 

-1.40 

4.831 

3.130 

3.132 

185 

29.605 

5.069 

ZnTe 

2.637 

-1.20 

5.090 

3.060 

3.120 

177 

29.445 

4.659 

CdTe 

2.806 

-1.10 

4.210 

1.680 

2.040 

141 

26.374 

2.722 

HgTe 

2.798 

-0.81 

4.759 

1.817 

2.259 

116 

29.773 

2.935 
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tal  values^“^’  for  the  do,  ^homi'  •"‘1  ***e  C||— C,2 
used  to  fit  the  parameters,  and  the  values  of  C44  and 
transverse  phonon  frequency  a>  used  for  the  consistency 
check. 

The  elastic  constants  can  be  calculated  directly  from 
perturbation  theory.  First  the  Hamiltonian  H(k)  is  ex¬ 
panded  in  powers  of  the  infinitesimal  strain  parameters  < 
up  to  second  order: 

//=//o  +  «l«'  +  T"2'^  (11) 


where  Hq  is  the  strain-free  Hamiltonian  and  //)  and 
are,  respectively,  the  first  and  second  derivatives  with 
respect  to  <r  evaluated  at  e=0.  The  band-energy  contri¬ 
bution  to  the  strain  coefficient  then  comes  from  the 
second  derivative  of  f^s  svith  respect  to  e,  denoted  by 

^-f-=22<c.k|//2li;,k) 

Oe  4 


+2222 

n  c  k 


l<».kl//,k.k)l^ 

E„(k)— £,(k) 


(12) 


which  is  consistently  larger  than  the  n  =  2  used  in 
Harrison’s  universal  TB  parameters.  For  the  repulsive 
pair  energy  u  =^Uo(do/d)"',  the  power  m  ranges  from  5.4 
to  7.1.  The  ratio  m  /n  falls  in  the  range  from  1.5  to  1.9, 
which  is  smaller  than  the  m/n  =  2  used  by  Harrison.’’ 
The  calculated  values  of  C44  for  most  systems  agree  with 
experiment  to  10%  or  better.  The  calculated  TO-phonon 
modes  at  F  for  most  III-V  systems  also  agree  with  experi¬ 
ments  to  10%  or  better.  The  discrepancies  for  the  II-VI 
systems  are  larger  (about  15%).  Reliable  results  for  f 
from  both  experiments  and  first-principles  calculations 
are  only  available  for  a  limited  number  of  semiconduc¬ 
tors.  The  calculated  f,  C44,  and  (o  from  the  TB  model 
agrees  very  well  with  those  results,  as  shown  in  Table  IV. 

The  results  in  Tables  III  and  IV  are  based  on  a  particu¬ 
lar  set  of  term  values  and  TB  parameters.  It  is  useful  to 
examine  how  the  predictions  are  influenced  by  these  pa¬ 
rameters  and  the  fitting  procedure.  Table  V  shows  a  re¬ 
sult  based  on  Chadi's  procedure’*  ’*  in  which  the  TB  ma¬ 
trix  elements  are  scaled  as  I/d’,  and  the  repulsive 
pair  energy  is  taken  to  be 


where  E,(k)  and  |c,k)  are,  respectively,  the  eigenenergies 
and  eigenvectors  of  Hq  for  the  conduction  bands.  Simi¬ 
larly,  t),k  indexes  the  valence  bands.  Note  that  the  inter¬ 
valence-band  contributions  in  the  second-order  perturba¬ 
tion  sum  cancel  exactly  and  so  they  are  not  needed  in  Eq. 
(12).  The  matrix  elements  of  //,  and  needed  here  can 
be  expressed  from  Eqs.  (3)-(7)  in  terms  of  the  first  and  the 
second  strain  derivatives  of  the  two-center  interactions 
Vgg.  and  the  direction  cosines  a, 

Table  III  shows  the  results  for  /,  n,  m,  and  Uq  obtained 
for  the  constituent  compounds,  and  the  corresponding 
values  of  C^^,  and  <0  calculated  as  a  consistency  check. 
The  scaling  factor  /  ranges  from  1.1  to  1.5  and  tends  to 
decrease  with  an  increase  in  polarity.  In  the  power-law 
dependence  V^^.^ido/d)",  n  ranges  from  3.3  to  4.2, 


u  —Uo  +  Uiid  —do)+U2id  —do)^  . 

The  parameter  Uq  '*  **•  to  produce  the  correct  bond  ener¬ 
gy,  u  I  is  determined  by  requiring  the  correct  equilibrium 
bond  length,  and  (t’tcd  by  the  bulk  modulus.  Two 
sets  of  TB  parameters  are  tabulated  for  each  system:  One 
of  them  is  the  set  used  by  Chadi’*  and  the  other  (labeled 
present  work)  is  the  set  obtained  by  multiplying 
Harrison’s  by  the  scaling  factor  /  listed  in  Table  III. 
For  convenient  comparison,  the  zero  of  the  term  values  is 
set  equal  to  the  anion  s  energy.  Despite  considerable 
differences  in  these  two  sets  of  TB  parameters,  the  results 
of  the  predictions  from  the  two  sets  are  very  similar  and 
also  very  similar  to  those  predicted  from  the  other  pro¬ 
cedure  given  in  Table  IV.  The  only  noticeable  difference 


TABLE  III.  The  results  for  the  parameters/,  n,  m,  and  Up  obtained  from  the  fitting  of  the  bond  en¬ 
ergy,  bond  length,  bulk  modulus,  and  shear  coefficient  C,,— C,2  of  Table  II  using  the  full  band- 
structure  calculation.  Also  listed  are  the  calculated  C*,.  internal-displacement  parameter  and  the 
TO-phonon  mode  <a  at  F.  All  the  elastic  constants  are  in  units  of  10"  dyn/cm^,  Uo  is  in  eV,  and  m  are 
given  in  terms  of  wave  numbers  in  cm~'. 


/ 

n 

m 

“o 

^44 

il) 

AlP 

1.294 

3.530 

5.598 

6.435 

5.827 

0.516 

447 

AlAs 

1.464 

3.524 

5.430 

7.089 

5.598 

0.459 

384 

AlSb 

1.337 

3.268 

5.668 

4.838 

3.944 

0.564 

354 

GaP 

1.395 

3.705 

5.683 

7.285 

6.857 

0.501 

382 

GaAs 

1.397 

3.633 

5.716 

6.530 

5.791 

0.500 

292 

GaSb 

1.431 

3.471 

5.717 

5.519 

4.515 

0.536 

256 

InP 

1.323 

4.240 

6.633 

5.603 

4.260 

0.584 

304 

InAs 

1.300 

3.997 

6.427 

4.962 

3.564 

0.552 

220 

InSb 

1.353 

3.773 

6.399 

4.350 

3.092 

0.602 

200 

ZnTe 

1.284 

3.306 

5.828 

4.285 

2.813 

0.590 

205 

CdTe 

1.171 

3.656 

6.761 

3.092 

1.701 

0.694 

156 

HgTe 

1.173 

3.760 

7.074 

3.080 

2.040 

0.716 

152 
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between  the  predictions  in  Table  IV  and  Table  V  is  that 
in  the  latter  the  phonon  frequencies  are  slightly  larger 
and  C44  are  slightly  smaller.  These  comparisons  show 
that  the  model  with  parameters  given  in  Table  II  not  only 
contains  the  correct  structure  properties  inserted  through 
the  fitting  procedure,  but  also  predicts  other  mechanical 
properties  with  reasonable  accuracy.  This  model  should 
serve  as  a  good  basis  for  alloy  calculations. 

III.  ALLOY  CALCULATION 
A  CuAn  I  stmctare  (CA) 

The  ABCj  semiconductor  alloy  in  the  CA  structure 
forms  a  layer  structure  ACBCACBC  ■  •  ■  along  the 
(001 )  direction.  The  basic  lattice  vectors  can  be  chosen 
as  a,=(j,|,0)fl,  a2=(j,  “I.Ola,  and  a3  =  {0,0,^)fl, 
where  P  is  the  c /a  ratio  with  a  and  c  being  the  lattice  pa¬ 
rameters.  Note  that  the  ideal  P  value  for  a  zinc-blende 
structure  is  1.  There  are  four  atoms  per  unit  cell:  one  A 
atom  at  (0,0,0),  one  B  atom  at  {\,0,p/2)a,  and  two  C 
atoms  at  (l,l,/9+6)a/4  and  (3,1,3^— &)<i/4,  where  S  is 


TABLE  IV.  Comparison  between  calculated  and  experimen¬ 
tal  lattice  constant  a,  elastic  constants  B,  C||— Cu  and  C44. 
Kleinman  (Ref.  42)  internal-displacement  parameter  and  the 
TO-phonon  frequency  in  wave  numbers  cm' '.  Also  listed  are 
C!m'  which  correspond  to  the  value  without  the  internal  distor¬ 
tion.  The  FP-LMTO  and  PP-PW  are  the  ab  initio  theories,  and 
TB  is  our  tight-binding  method  discussed  in  the  text.  All  elastic 
constants  are  in  units  of  10]'  dyn/cm^  Experimental  values  are 
those  listed  in  Table  II. _ 


Expt. 

FP-LMTO* 

PP-PW* 

TB 

Si 

a 

3.431 

3.41 

5.45 

5.431 

B 

9.923 

9.9 

9.3 

9.923 

Cm  “C|2 

10.274 

10.2 

9.8 

10.274 

Cm 

8.036 

8.3 

8.3 

8.013 

^44 

II. 1 

11.30 

e 

0.54' 

0.51 

0.33 

0.51 

Of 

523 

318 

321 

572 

Ge 

a 

3.63 

5.39 

5.65 

B 

7.653 

7.2 

7.653 

Cm  “Cii 

8.189 

8.3 

8.189 

C44 

6.816 

6.3 

6.84 

^44 

7.7 

9.46 

S 

0.44 

0.49 

0} 

303 

302 

342 

GaAs 

a 

3.642 

5.55 

3.642 

B 

7.69 

7.3 

7.69 

Cm  ~C|2 

6.63 

7.0 

6.63 

^44 

6.04 

6.2 

5.791 

/'(Oi 

^44 

7.5 

7.83 

0.48 

0.50 

6) 

273 

268 

292 

•Reference  33. 
•Reference  49. 
•Reference  SO. 


the  internal  distortion  parameter  for  the  C  atoms.  There 
are  only  two  different  nearest-neighbor  bond  lengths  in 
the  alloy: 

and 

d,c  =  <a/4)[2  +  (P-6)^]''^  . 

For  a  set  of  values  for  a,  P,  and  6,  the  total  energy  is  cal¬ 
culated  following  the  general  description  in  Sec.  II.  The 
interactions  and  the  repulsive  energy  are  scaled  by 
the  bond  lengths  according  to  Eqs.  (8)  and  (9),  respective¬ 
ly.  The  Hamiltonian  JV(k)  is  now  a  16X 16  matrix.  The 
total  energy  is  then  minimized  by  varying  the  three  pa¬ 
rameters  a,  P,  and  6.  If  the  alloy  is  constrained  to  match 
a  ((X)l)  substrate,  the  lattice  constant  a  is  fixed  by  the  sub¬ 
strate,  and  the  total  energy  is  minimized  with  respect  to  P 
and  5. 

B.  Chalcopyrite  structure  (CH) 

The  ABC  2  semiconductor  alloy  in  the  CH 
structure  forms  a  superlayer  structure 
ACACBCBCACACBCBC  ■  ■  ■  along  the  (012)  direc¬ 
tion.  The  basic  lattice  vectors  can  be  chosen 
as  a,  =  (l,l,-2^)fl/2,  a2=(-l,l,2P)fl/2,  and 
a3=(l,  — l,2^)a/2,  where  P  again  is  the  c/a  ratio,  with 
an  ideal  value  of  1.  There  are  now  eight  atoms  per  unit 
cell:  two  A  atoms  at  (0,0,0)  and  (0,1  j9)o/2,  two  B  atoms 
at  (l,Oj3)a/2  and  (l,l,0)a/2,  and  four  C  atoms  at 
(l+b,\,P)a/4,  (l,3+8,3/?)a/4,  (3,1 -8.3^)a/4,  and 
(3— 6,3,^)a/4,  where  6  is  the  internal  distortion  parame¬ 
ter  for  the  C  atoms.  Again  there  are  also  only  two 
different  nearest-neighbor  bond  lengths  in  the  alloy: 

and 

A  similar  procedure  can  now  be  carried  out  to  minimize 
the  total  energy  with  respect  to  the  three  parameters  a,  P, 
and  6.  However,  the  Hamiltonian  (k)  is  now  32X32. 
For  the  case  in  which  the  lattice  is  matched  to  the  (001) 
substrate,  we  again  are  left  with  two  parameters  P  and  6 
to  vary  for  the  energy  minimization. 

C.  OiPt  structaic  (CP) 

In  the  CP  structure,  the  alloy  forms  a  (ill)  superlat¬ 
tice  ACBCACBC  •  ••  .  Because  of  a  lack  of  reflection 
symmetry  about  any  of  these  planes,  the  B  layer  need  not 
b«  located  exactly  at  the  middle  position  between  the  two 
successive  A  layers.  Also  the  distance  between  two 
closest  atoms  from  two  different  A  layers  may  not  need 
to  correlate  with  that  between  two  A  atoms  on  the  same 
plane.  Thus  there  are  a  total  of  five  independent  parame¬ 
ters  required  to  describe  the  crystal  structure:  the  lateral 
lattice  constant  a  for  the  layers,  the  spacing  D  between 
two  successive  A  layers,  and  the  three  spacing  parame¬ 
ters  for  the  three  layers  (one  B  and  two  C)  inside  D. 
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TABLE  V.  Comparison  between  the  two  different  sets  of  TB  parameters  described  in  the  text;  the  resultant  coefficients  Uo,  U|,  and 
M2  of  u,  and  the  predicted  elastic  constants,  Kleinman  internal-displacement  parameters  g,  and  phonon  frequency  u. _ 
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There  are,  however,  only  four  atoms  per  unit  cell,  each 
coming  from  a  different  layer,  so  the  /f(kl  matrix  is 
I6X  16.  There  are  also  four  different  bond  lengths  in  the 
crystal,  two  for  the  AC  and  two  for  the  BC  bonds.  If  the 
alloy  lattice  is  matched  to  a  (111)  substrate,  the  lateral 
lattice  parameter  is  fixed,  but  we  still  have  four  parame¬ 
ters  to  vary  for  energy  minimization.  However,  if  the  al¬ 
loy  is  matched  to  a  (001)  substrate,  we  shall  assume  that 
ail  the  alloying  atoms  A  and  B  are  locked  into  the  fee  lat¬ 
tice  points  of  the  substrate,  and  we  are  left  with  only  two 
free  parameters  which  describe  the  relaxation  of  the  two 
nonequivalent  C  layers. 

It  is  useful  to  comment  on  the  Brillouin-zone  (BZ)  in¬ 
tegration  needed  for  the  calculation  of  the  electronic  en¬ 
ergy.  Since  our  calculation  involves  relatively  small  and 
easily  handled  matrices,  we  are  able  to  sample  over  a 
large  number  of  k  points.  We  found  that  a  uniform  grid 
of  I  (XX)  k  points  inside  the  BZ  always  guarantees  a  con¬ 
vergence  of  the  total  energy  pair  of  atom  to  within  10“’ 
meV  and  the  elastic  constants  to  an  accuracy  in  the  third 
digit.  We  found  that  total  energies  calculated  using  two 
special  k  points”  are  about  10  meV  per  atom  pair  higher 
than  the  converged  value.  However,  when  extending  the 
two-special-k-point  method  to  the  CA  and  CH  struc¬ 
tures,”  most  of  the  errors  of  the  alloy  and  pure  com¬ 
pound  cancel,  and  we  found  that  the  final  errors  in  the 
excess  energies  are  only  about  0.5  meV  per  atom  pair. 

Before  presenting  the  results,  we  briefly  describe  the 
Keating’*  valence-force-field  (VFF)  model.  The  VFF 
only  deals  with  the  strain  energy.  The  energy  per  unit 
volume  in  a  strained  diamond  or  zinc-blende  crystal  is 
given  by 


£/  =  -^  2  a, [  A(d,  -d, ) ]’  +  -^  2  2  [^'0.  -d,  >]^ 

o«  0 


_3_ 

8d’ 


'  >i 


(13) 


where  in  the  first  term,  the  bond-stretching  energy,  the 


summation  i  runs  over  all  the  bonds,  and  in  the  second 
term,  the  bond-angle  contribution,  the  summations  in¬ 
clude  all  the  pairs  of  bonds  that  share  common  atoms. 
The  do  in  Eq.  (13)  is  the  equilibrium  bond  length,  and 
A(d,-d^)  is  the  strain-induced  change  of  the  dot  product 
between  the  two  bond  vectors  which  point  from  the  com¬ 
mon  atom  to  the  nearest -neighbor  atoms.  For  a  zinc- 
blende  crystal,  there  is  only  one  value  for  the  bond¬ 
stretching  force  constants  a,  =a,  and  one  value  for  the 
bond-angle-restoring-force  constant  Pij=p.  Their  values 
are  determined’*  by  fitting  the  experimental  bulk  moduli 
B  and  shear  elastic  constants  C|i  C|2,  and  are  tabulated 
in  Table  II.  We  extend  Eq.  (13)  to  calculate  the  strain  en¬ 
ergy  in  an  ABCi  alloy  by  treating  each  bond  and  each 
pair  of  the  same  bonds  in  the  same  way  as  in  the  constitu¬ 
ent  compounds.  However,  when  dealing  with  the  bond- 
angle  term  involving  two  unlike  AC  and  BC  bonds,  both 
dfj  and  Pij  are  taken  to  be  the  average  values.  The  results 
from  VFF  will  be  compared  with  the  ETB  calculations  in 
Sec.  IV. 


IV.  DISCUSSION  OF  ALLOY  RESULTS 
A.  Excess  energies 

Before  presenting  the  results  for  different  kinds  of  ex¬ 
cess  energies,  the  readers  should  be  reminded  of  the  rela¬ 
tion  between  these  energies  and  ordering.  The  Appendix 
provides  a  qualitative  discussion  of  this  relation.  It  also 
describes  how  different  kinds  of  excess  energies  presented 
below  may  alter  the  picture  of  ordering. 

The  bulk  excess  energies  A£  calculated  from  ETB  for 
the  ABCi  alloys  in  the  three  structures  of  CA,  CH,  and 
CP  are  listed  in  Table  VI  along  with  the  results  from 
VFF  and  LDF.  First  we  observe  that  ETB  and  VFF  give 
very  similar  results.  Both  models  produce  very  small  A£ 
values  for  those  alloys  composed  of  compounds  with 
nearly  equal  lattice  constants.  However,  A£  for  AIGaP^ 
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and  HgCdTe2  derived  from  ETB  are  slightly  negative. 
The  differences  between  the  ETB  and  VFF  results  in 
these  systems  are  a  measure  of  the  small  size  of  the 
chemical-energy  contribution  to  A£  in  the  ETB  model. 
For  the  lattice-mismatched  alloys,  all  the  A£  values  from 
ETB  are  positive  and  are  slightly  larger  than  those  ob¬ 
tained  from  VFF  except  for  InAlPj.  The  major  reason 
VFF  yields  a  smaller  A£  is  that  the  VFF  used  here  con¬ 
sistently  predicts  smaller  values  of  the  elastic  constant 
C44  than  the  experimental  values,  thereby  underestimat¬ 
ing  the  strain  contribution  to  A£.  However,  the  small 
chemical  energy  included  in  ETB  may  upset  this  trend, 
exemplified  by  InAlPj. 

The  quantitative  comparison  between  ETB  and  LDF  is 
mixed.  Starting  from  the  lattice-matched  alloys 
GaAlAs2,  the  A£  from  LDF  calculated  by  different 
groups  range  from  7.5  to  35  meV  for  the  three  structures 
considered,  as  compared  to  nearly  zero  calculated  from 
ETB.  Similar  differences  between  LDF  and  ETB  also 
occur  in  another  lattice-matched  system,  HgCdTe2. 
Since  the  strain  energy  is  nearly  zero  in  these  systems,  the 
10-meV  or  so  difference  between  the  present  calculation 
and  the  majority  of  the  LDA  results  represents  the 
discrepancy  in  the  estimate  of  the  chemical  energy  in  A£ 
between  the  two  theories.  These  differences  certainly  are 
well  within  the  margins  of  errors  of  both  ETB  and  LDF. 
However,  we  note  the  LDA  A£  values  for  GaAlAs2  in 
the  CA  structure  are  rather  consistent  except  the  35  meV 
from  Ref.  56.  For  the  lattice-mismatched  alloys,  ETB 
agrees  very  well  (within  20  meV)  with  LDF  for  the  IIl-V 
alloys  in  the  CP  structure  and  II-VI  alloys  in  the  CH 
structure.  The  agreement  is  also  reasonable  for  all  alloys 
in  the  CA  structure.  However,  the  differences  between 
the  two  calculations  are  more  substantial  for  the  III-V  al¬ 
loys  in  the  CH  structure  and  II-VI  alloys  in  the  CP 
structure.  We  note  that  although  the  trend 
A£ch  <  among  the  three  structures  holds 

for  both  ETB  and  LDF  for  these  lattice-mismatched  al¬ 
loys,  the  A£(;h  values  from  LDF  for  the  IIl-V  systems 
are  considerably  lower  than  those  from  ETB,  particularly 
the  negative  values  calculated  for  InAlP2  and  InAlAs2. 
Despite  all  these  quantitative  differences,  the  qualitative 
trends  of  our  ETB  predictions  are  rather  similar  to  those 
from  LDF  by  Wei  and  co-workers‘  ”  as  shown  in  Fig.  1. 
The  most  important  conclusion  that  can  be  drawn  from 
Table  VI  is  that  these  ordered  bulk  alloys  are  not  the 
thermal  equilibrium  states  at  the  experimental  growth 
temperatures  shown  in  Table  I.  For  this  to  happen,  the 
A£  value  has  to  be  less  than  —200  meV  per  four  atoms, 
as  mentioned  earlier.  This  A£  value  is  far  below  all  the 
calculated  values  and  is  beyond  the  uncertainties  of  our 
ETB  model  and  the  LDF  calculations  listed  above. 

While  the  above  excess  energies  do  not  support  a 
thermally  stable  ordered  bulk  alloy  at  the  growth  temper¬ 
ature  (s=  600*0,  they  may  offer  different  conclusions 
when  applied  to  the  epitaxially  grown  alloys.  The  idea 
was  first  suggested  by  Ilynn’^  in  connection  with  epitaxi¬ 
al  growth  of  ordered  metal  alloys.  If  the  substrate  ma¬ 
terial  serves  as  a  reservoir  for  the  epitaxial  film,  then  the 
alloys  grown  on  the  substrate  have  to  be  in  thermal  and 
mechanical  equilibrium  with  the  substrate.  Since  most 


substrates  used  in  the  growth  are  selected  to  have  their 
lattice  constants  close  to  the  equilibrium  lattice  parame¬ 
ters  (see  Tables  I  and  IX)  of  the  ordered  alloys  mentioned 
above,  it  takes  little  strain  energy  for  these  alloys  to 
match  the  substrates.  On  the  other  hand,  if  an  alloy  on 
the  substrate  is  to  decompose  into  two  substrate  lattice- 
mismatched  constituent  compounds,  it  takes  energy  to 
constrain  the  separated  systems.  Only  the  case  of  thin 
layers  in  which  no  dislocations  form  will  be  treated. 
Then  the  reference  constituent  energies  EiAB)  and 
E  (AC)  in  Eq.  (1)  which  must  be  used  to  calculate  the  ex¬ 
cess  energy  for  epitaxial  growth,  referred  to  as  A£,^,  are 
the  ones  with  their  lattice  parameters  matched  to  the  sub¬ 
strate.  This  coherent  strain  can  make  A£,^  negative  for 
an  alloy  even  when  its  bulk  excess  energy  A£  is  positive. 
Of  course,  the  actual  values  of  excess  energies  are  sub¬ 
strate  specific.  However,  an  overall  assessment  can  be 
made  when  the  substrate  is  also  an  alloy  by  choosing  the 
substrate  lattice  constant  to  be  the  average  value 

a,=S  =  {a°c+OBc)/2  ■ 

The  major  difference  between  A£,^  and  A£  then  comes 
from  the  strain  energies  of  the  constituent  compounds 
forced  to  lattice  match  the  substrate.  Since  the  energies 
of  the  ordered  alloys  at  their  equilibrium  lattice  constants 
a  only  differ  by  a  small  amount  (1  or  2  meV)  from  those 
at  a,  we  shall  use  the  alloy  energies  already  calculated  in 
Table  VI  to  deduce  the  epitaxial  excess  energy. 

The  strain  energy  of  either  constituent  compound 
matched  to  a,  =5  on  the  (001)  substrate  can  be  estimated 
from  the  following  simple  formula; 

£,(/4C)  =  (C,,-f2C,2-4C?2/C„)6ja//4  ,  (14) 

where  percentage  lattice- 

parameter  difference  between  the  two  constituent  com¬ 
pounds.  The  strain  energies  used  to  calculate  the  values 
of  A£,p  in  Table  VII,  however,  are  obtained  from  ETB, 
which  in  fact  only  produces  a  small  correction  (1  to  2 


MnP,  eoM>  ea/BSb  eoliW^  H(ZnT^  CdZnT^  Go/M 


FIG.  1.  The  excess  energies  of  Eq.  (1)  for  the  ordered  alloys 
in  three  crystal  structures  calculated  from  the  present  TB  model 
(solid  circles)  and  from  LDF  in  Ref.  6  (open  circles). 
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meV  at  most)  to  Eq.  (14).  The  relative  values  of 
among  the  three  structures  on  the  same  substrate,  say, 
((X)l),  should  remain  the  same  as  those  of  A£,  because 
these  two  energies  only  differ  in  the  reference  energy. 
For  example,  the  relation 

A£,,(CH )  <  A£,,(CA )  <  A£,p(CP) 

still  holds  for  alloys  composed  of  lattice-mismatched  con¬ 
stituent  compounds.  On  the  (001)  substrate,  the  A£,^ 
values  ar?  essentially  all  positive  in  the  CP,  negative  in 
the  CH,  and  switch  between  positive  and  negative  values 
in  the  CA  structure.  The  CP  alloys  matched  to  the  (1 1 1) 
substrate  have  very  small  AE,^  with  magnitudes  smaller 
than  10  meV.  The  LDF  epitaxial  energies  calculated  by 
Wei,  Ferreira,  and  Zunger^  have  the  same  qualitative 
trends  as  those  from  ETB  among  the  three  structures. 
The  quantitative  differences  in  AE,^  between  the  two  cal¬ 
culations  are  larger  and  also  more  scattered  than  those  in 
A£.  This  is  peculiar,  because  the  difference  between  AE 
and  AE,p  in  both  calculations  comes  from  the  strain  ener¬ 
gies  of  the  constituent  compounds,  which  can  be  reliably 
estimated  from  Eq.  (14)  and  are  well  prescribed  by  our 


ETB  model.  Despite  these  discrepancies,  a  simple  con¬ 
clusion  can  also  drawn  from  Table  VII.  These  num¬ 
bers  still  cannot  account  for  the  ordering  observed  exper¬ 
imentally,  because  all  the  calculated  AE,^  are  still  well 
above  the  —  200-meV  value  required  for  the  ordering  at 
the  growth  temperatures.^’ 

In  the  above  consideration,  the  constituent  compounds 
are  constrained  to  match  the  lateral  lattice  of  the  sub¬ 
strate  and  are  allowed  to  relax  fully  in  the  growth  direc¬ 
tion.  For  a  (001)  substrate  with  a  lattice  constant  Oj  =a, 
the  c/a  ratio  for  either  compound  is  estimated  to  be 

c/a  =  l-)-8o(C,j/C,, -fO.5)  ,  (15) 

where  &o  again  is  the  percentage  bond-length  difference 
between  the  two  constituent  compounds.  This  c  /a  relax¬ 
ation  results  in  elongation  for  one  compound  and  shrink¬ 
age  for  the  other  along  the  growth  direction.  If  the  lat¬ 
tice  constants  of  two  constituent  crystals  have  a  substan¬ 
tial  difference,  these  opposite  relaxations  will  produce 
strained  grain  boundaries  between  the  AC  and  BC  crys¬ 
tals.  The  reference  state  energy  E(  AC)+EIBC)  used  in 


TABLE  VI.  The  bulk  excess  energies  AE  (in  meV  per  four  atoms)  from  ETB  and  comparison  with  the  results  from  local-density- 
functional  (LDF)  theory  and  the  valence-force-field  (VFF)  model. 


Alloys 

CuAu  1 

Chalcopyrite 

CuPl 

ETB 

LDF 

VFF 

ETB 

LDF 

VFF 

ETB 

LDF 

VFF 

AIGaP2 

-2.4 

1.5 

-2.8 

1.0 

-2.6 

0.2 

AIGaAs, 

0.6 

10.8* 

0.3 

0.6 

11.4'' 

0.2 

0.8 

7.5' 

0.4 

11.5' 

9.8' 

15.l‘ 

D.S"* 

35' 

AlInPj 

69.0 

43.0' 

74.0 

44.0 

-21.0' 

47.3 

114.2 

97.0' 

111.1 

AIInAs2 

68.8 

35.0' 

66.6 

45.8 

-15.0' 

43.7 

107.0 

97.6 

Ga2PAs 

30.0 

26.6* 

23.1 

19.4 

6.5* 

15.0 

33.6 

37.2* 

32.7 

Ga2PSb 

260.2 

207.0 

67.4 

135.8 

290.8 

292.6 

Ga2AsSb 

II3.0 

129.2* 

91.8 

69.8 

52.0' 

59.9 

128.0 

132.0* 

130.1 

114.8'' 

115.0' 

GaInP2 

88.4 

11 5.6* 

83.8 

57.2 

19.0' 

54.3 

139.4 

155.4' 

124.9 

91.0' 

54.4‘ 

GaInAs2 

73.2 

60.1' 

67.3 

48.2 

16.5» 

43.8 

113.0 

108.5» 

99.5 

83.6' 

66.7' 

Ga!nSb2 

57.4 

51.5 

37.4 

33.2 

85.58 

76.2 

ZnCdTej 

34.3 

54.2« 

33.7 

22.4 

19.2' 

21. 1 

65.3 

103.5» 

56.0 

ZnHgTe2 

29.7 

42.5* 

32.1 

21.0 

11. 4» 

19.9 

54.9 

103.3' 

53.5 

CdHgTej 

-2.3 

12.1' 

0.61 

-2.7 

11. 3» 

0.4 

-2.7 

9.8' 

1.1 

•Reference  28. 
•’Reference  31. 
’Reference  27. 
'•Reference  55. 
'Reference  56. 
•Reference  53. 


•Reference  6. 
•'Reference  30. 
'Reference  32. 
'Reference  29. 
‘Reference  54. 
'Reference  74. 
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TABLE  VII.  The  excess  energy  (in  meV  per  four  atoms)  for  epitaxial  alloys  calculated  from  the  present  TB  model  and  com- 
parison  with  the  LPF  results.   


Structure 

CuAu  I 

Chalcopyrite 

CuPt 

substrate 

(100) 

(100) 

(111) 

(100) 

(100) 

(Alloy) 

TB 

LDF 

TB 

LDF 

TB 

TB 

LDF 

AIGaPj 

-2.9 

-3.1 

0.5 

-3.0 

AIGaAs- 

0.8 

0.7 

0.4 

0.2 

AllnP^ 

-11.5 

-36.0“ 

-36.3 

-0.2 

33.8 

-18.0* 

AlInAs, 

-4.6 

-27.8 

-7.4 

33.6 

Ga^PAs 

5.3 

4.1* 

-5.3 

-21* 

0.2 

8.9 

0.5* 

GajPSb 

40.5 

-52.2 

7.8 

71.1 

Ga^AsSb 

15.5 

35.0^ 

-27.6 

-28" 

-5.6 

30.5 

52.0" 

17.0* 

-45.8* 

11.5' 

48.(y 

GalnP] 

-3.3 

-3.0* 

-34.5 

-106* 

-9.4 

47.6 

3.4* 

12.8' 

GalnAS] 

0.4 

-9.r 

-25.4 

-9.6 

39.5 

4.1* 

10.5" 

6.7" 

8.4" 

29.6' 

GalnSb2 

1.4 

-18.5 

-5.0 

29.9 

ZnCdTej 

-5.6 

-5.4' 

-17.5 

-57.2* 

6.6 

25.5 

0.2* 

ZnHgTej 

-5.9 

-13.5' 

-14.7 

-64.2‘ 

3.0 

19.3 

0.9* 

CdHgTe, 

-2.4 

-2.8 

-3.0 

-2.9 

•Reference  6. 
Reference  27. 
•Reference  28. 
"Reference  29. 


the  calculation  of  A£^  in  Table  VII  assumed  that  the 
decomposed  AC  and  BC  phases  are  macroscopic  crystals. 
The  domain-wall  energy  contributions  were  neglected  be¬ 
cause  there  are  few  boundaries.  However,  a  realistic  path 
between  the  completely  separated  and  fully  relaxed  AC 
and  BC  domains  is  likely  to  pass  through  a  sequence  of 
intermediate  states,  including  the  stage  of  forming  micro¬ 
scopic  AC  and  BC  clusters  which  serve  as  nucleation 
centers.  As  an  approximation  to  this  phase  space  im¬ 
mediately  adjacent  to  the  ordered  ABC2  alloy,  we  have 
estimated  the  metastable  nucleation  energy  barrier  by  as¬ 
suming  that  the  microscopic  AC  and  BC  clusters  are  lat¬ 
tice  matched  to  the  ABCj  alloy  and  their  energetics  can 
be  estimated  from  the  bulk  crysuk  under  the  same  con¬ 
straint.  Because  the  c/a  ratios  for  the  ordered  alloy  are 
nearly  unity  (see  Table  VIII),  this  ^itaxial  energy  against 
nucleation  of  AC  and  BC  clusters  is  equivalent  to  using  a 
reference  energy  Ei  AC)+EiBC)  which  disallows  the 
c/a  relaxation.  Consequently,  this  epiuxial  excess  ener¬ 
gy  for  the  alloy,  referred  to  as  A£"  (the  hard  model),  is 
lowered  further  as  shown  in  Table  VIII.  Note  that  the 
values  for  a  number  of  alloys  have  already  attained  mag¬ 
nitudes  that  could  account  for  the  stability  of  the  ob¬ 
served  LRO  of  lattice-mismatched  alloys.  However,  de- 
uils  of  the  mechanism  that  causes  the  LRO  to  preferen¬ 
tially  form  in  the  first  place  must  still  be  determined. 


B.  Bond  lengths 

In  addition  to  the  excess  energies,  our  calculations  also 
produce  detailed  information  about  the  equilibrium 
structures  of  the  alloys.  Table  IX  lists  the  c/a  ratio  and 


TABLE  VIII.  The  excess  energy  (in  meV  per  four 
atoms)  for  epitaxial  alloys  without  (c/a)-ratio  relaxation. _ 


CuAu  I 

Chalcopyrite 

CuPt 

AIGaP, 

-3.1 

-3.4 

-3.3 

AlGaA$2 

0.7 

0.6 

0.2 

AllnP, 

- 192.5 

-217.4 

- 147.3 

AlInAS} 

- 149.0 

-172.2 

-110.8 

GajPAs 

-37.5 

-48.1 

-33.9 

GajPSb 

-354.4 

-447.1 

-323.8 

GajAsSb 

- 162.2 

-205.2 

-147.1 

GalnP; 

-200.4 

-231.6 

- 149.5 

GalnAsj 

- 149.8 

- 174.8 

-109.9 

GaInSb2 

-113.9 

-133.8 

-85.3 

ZnCdTej 

-131.6 

-143.5 

-100.5 

ZnHgTe, 

-150.3 

-159.1 

-125.1 

CdHgTej 

-2.8 

-3.2 

-3.2 
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the  first-neighbor  bond  lengths  for  all  the  alloys  studied.  constituent  crystals  are  in  the  order  d3c><^ac-  The 

These  results  are  particularly  useful  if  experiments  such  bond  lengths  in  the  CA  and  CH  structures  are  in  general 

as  EXAFS  are  carried  out  to  measure  the  local  structures  bimodal,  with  d^c>^BC-  This  result  is  similar  to  that 

of  these  LRO  alloys.  As  mentioned  earlier,  the  c /a  ratios  found  in  the  disordered  bulk  alloys.  There  are  four 

are  nearly  unity  for  all  alloys.  The  notations  used  in  the  different  bond  lengths  in  the  CP  structure:  djc> 

table  are  such  that  the  equilibrium  bond  lengths  of  the  and  d|f ,  where  superscripts  T  and  S  mean  the  triple 

TABLE  IX.  The  equilibrium  bond  lengths  d^c  and  and  the  equilibrium  average  lattice  con¬ 
stants  a  (all  in  A).  In  CP  structure,  the  first  values  d^c  and  d^  are  for  those  bonds  along  the  (111) 
direction,  and  the  second  values  are  for  those  in  the  other  three  directions. 


ABC  2  _ c/a _ a  _ d^c _ ”ac 


AIGaAs2 

CA 

1. 000 

5.658 

2.450 

2.449 

CH 

1.000 

5.656 

2.450 

2.449 

CP 

0.998 

5.660 

2.451 

2.449 

2.444 

2.451 

AlGaP, 

CA 

1.000 

5.457 

2.365 

2.362 

CH 

1. 000 

5.459 

2.366 

2.361 

CP 

0.998 

5.461 

2.371 

2.362 

2.354 

2.366 

InGaSbj 

CA 

1.009 

6.272 

2.781 

2.669 

CH 

0.995 

6.298 

2.788 

2.660 

CP 

1.001 

6.295 

2.788 

2.762 

2.650 

2.699 

InAIAS] 

CA 

1.013 

5.831 

2.597 

2.479 

CH 

0.992 

5.870 

2.604 

2.469 

CP 

1.001 

5.861 

2.599 

2.578 

2.456 

2.512 

InGaAS] 

CA 

1.012 

5.834 

2.597 

2.478 

CH 

0.993 

5.866 

2.605 

2.468 

CP 

1.002 

5.859 

2.602 

2.577 

2.457 

2.510 

InAlP: 

CA 

1.007 

5.658 

2.520 

2.395 

•  CH 

0.996 

5.674 

2.527 

2.384 

CP 

1.001 

5.677 

2.516 

2.504 

2.381 

2.425 

InGaPj 

CA 

1.013 

5.640 

2.518 

2.390 

CH 

0.993 

5.672 

2.526 

2.379 

CP 

1.003 

5.665 

2.520 

2.499 

2.368 

2.425 

AsPGa; 

CA 

1.001 

5.549 

2.430 

2.379 

CH 

0.999 

5.552 

2.437 

2.371 

CP 

0.998 

5.556 

2.450 

2.422 

2.354 

2.390 

SbAsGa^ 

CA 

1.003 

5.880 

2.601 

2.499 

CH 

0.998 

5.884 

2.613 

2.482 

CP 

0.995 

5.905 

2.637 

2.585 

2.439 

2.531 

SbPGa^ 

CA 

1.008 

5.776 

2.586 

2.435 

CH 

0.994 

5.787 

2.603 

2.407 

CP 

0.993 

5.822 

2.635 

2.567 

2.344 

2.486 

CdHgTej 

CA 

1.000 

6.471 

2.805 

2.799 

CH 

1.000 

6.471 

2.806 

2.798 

CP 

0.999 

6.473 

2.817 

2.798 

2.788 

2.805 

HgZnTej 

CA 

1.010 

6.256 

2.783 

2.656 

CH 

0.995 

6.286 

2.787 

2.650 

CP 

1.003 

6.277 

2.771 

2.770 

2.659 

2.679 

CdZnTe, 

CA 

1.016 

6.252 

2.790 

2.657 

CH 

0.991 

6.302 

2.795 

2.650 

CP 

1.007 

6.275 

2.787 

2.771 

2.647 

2.685 
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TABLE  X.  The  ratios  =  —3)Ad(,— 3)  for  the  ordered  alloys  calculated  from  the  present  TB 

model  and  comparison  with  experimental  results  for  the  disordered  alloys. _ 


CA 

CH 

Disordered  alloys 

ETB 

ETB 

Experiments 

ABCi 

y^r 

Ytc 

r*c 

Ybc 

Y  AC 

Ybc 

InGaSbi 

0.71 

0.64 

0.80 

0.76 

0.89* 

0.88* 

InGaAs^ 

0.71 

0.66 

0.80 

0.78 

0.77* 

0.80* 

InGaP2 

0.74 

0.67 

0.83 

0.79 

0.80* 

0.76* 

AsPGa^ 

0.59 

0.57 

0.75 

0.75 

0.76* 

0.75* 

HgZnTej 

0.81 

0.76 

0.86 

0.84 

0.72" 

0.73" 

'Reference  58. 
’’Reference  2. 


and  single  bonds,  respectively.  However,  the  alloy  lattice 
constants  a  are  quite  close  to  the  mean  value  S  of  the  con¬ 
stituent  compounds.  A  more  sensitive  measure  of  the  bi- 
modal  distribution,  following  Boyce  and  Mikkelseiu'  is  to 
look  at  the  ratio  =  w'***  dbeing 

the  average  bond  length,  and  similarly,  Ybc-  Table  X 
compares  these  ratios  for  the  CA  and  CH  structures  with 
the  experimental  values  for  the  disordered  bulk  alloys. 
Note  that  a  value  of  y  =  1  corresponds  to  the  totally  re¬ 
laxed  case  where  there  is  no  bond  stretching,  whereas 
y=0  corresponds  to  a  rigid  virtual  crystal  where  all  the' 
atoms  are  on  the  zinc-blende  crystal  sites.  In  terms  of  the 
valence-force  model,  this  ratio  is  roughly  y=a/(a  +  2^), 
where  a  and  /5  [see  (13)]  are  the  average  values  for  the 
bond-stretching  and  angular-restoring-force  constants  of 
the  two  constituent  compunds.  The  calculated  y  ratios  in 
Table  X  follow  this  trend  (see  the  a  and  values  in  Table 
II).  However,  the  values  of  y  show  that  CH  structures 
are  more  relaxed  than  the  CA  structures.  This  result  is 
consistent  with  the  lower  A£  values  in  CH  than  in  CA 
shown  in  Table  VI.  The  ETB  values  of  y  for  GalnAsj, 
GaInPj,  and  GajAsP  in  the  CH  structure  are  very  close 
to  the  experimental  values^  ’*  for  the  disordered  alloys. 


This  result,  when  correlated  with  the  lower  excess  energy, 
may  suggest  that  the  disordered  alloys  tend  to  favor  a  lo¬ 
cal  configuration  of  the  CH  structure.  However,  the  cal¬ 
culated  y  values  for  GaInSb2  and  HgZnTej  do  not  corre¬ 
late  well  with  the  experimental  results.  As  a  matter  of 
fact,  the  trend  as  a  function  of  the  fi/a  ratio  in  the  exper¬ 
imental  results  is  reversed.  It  would  be  interesting  to  see 
if  these  two  ordered  alloys  do  have  different  bond  lengths 
from  the  disordered  states.  While  the  values  of  y  in  this 
table  range  from  0.6  to  0.9  for  the  lattice-mismatched  al¬ 
loys,  this  trend  does  not  hold  for  the  lattice-matched  al¬ 
loys.  For  example.  Table  IX  shows  that  for  Hg — Te  and 
Cd — ^Te  bond  lengths  in  HgCdTej,  both  CH  and  CA 
structures  nearly  retain  their  respective  constituent  crys¬ 
tal  values  (i.e.,  y  s:  1),  which  was  also  seen  in  a  recent  ex¬ 
periment  on  the  bulk  alloy.  Finally,  the  bond  lengths 
for  the  CP  structure  are  characteristically  different  from 
those  in  CA  and  CH  structures.  The  single  bonds  along 
the  ordering  direction  [111]  tend  to  be  close  to  the  con¬ 
stituent  values  while  the  triple  bonds  in  the  other  direc¬ 
tions  have  less  relaxations  (with  y  values  around  O.S  or 
less). 


TABLE  XL  Bulk  moduli  (in  10"  dyn/cm^)  of  ordered  alloys  in  three  crystal  structures  calculated  from  the  present  TB  model  and 
the  percentage  deviations  AB  =(B  ~B..  )/B..  from  the  average  values  B,,.  of  the  constituent  compounds. 

ABch  ABca  ABce 

_ Bch  _ ffc* _ ffcr _ _ (%> _ (%) _  (%) 


AIGaP; 

8.858 

AIGaAs2 

7.695 

AllnP] 

7.876 

AIInAs3 

6.705 

Ga]PAs 

8.328 

GaiPSb 

6.584 

GajAsSb 

6.314 

GaInPj 

8.007 

GalnAs; 

6.610 

GalnSbj 

5.226 

ZnCdTej 

4.6105 

ZnHgTej 

4.8898 

CdHgTej 

4.4697 

8.858 

8.854 

7.693 

7.689 

7.860 

7.774 

6.691 

6.661 

8.291 

8.294 

6.297 

6.188 

6.198 

6.157 

8.035 

8.878 

6.579 

6.508 

5.202 

5.156 

4.6035 

4.3375 

4.8872 

4.6323 

4.4721 

4.4706 

8.8715 

-0.150 

7.7085 

-0.181 

7.9235 

-0.605 

6.7605 

-0.828 

8.4165 

-1.046 

7.4675 

-11.836 

6.7410 

-6.342 

8.1950 

- 1.437 

6.7420 

-1.961 

5.3115 

-1.607 

4.6500 

-0.85 

4.9245 

-0.71 

4.4845 

-0.33 

-0.15 

-0.201 

-0.20 

-0.257 

-0.08 

-1.882 

-1.03 

-1.475 

-1.50 

- 1.457 

-15.68 

-17.135 

-8.05 

-8.662 

-1.95 

-3.865 

-2.42 

-3.474 

-2.07 

-2.923 

-1.00 

-6.72 

-0.76 

-5.93 

-0.28 

-0.31 

C-11 


43 


FORMATION  ENERGIES,  BOND  LENGTHS,  AND  BULK  . . . 


9149 


C.  Bulk  moduli 

Table  XI  lists  the  results  for  the  bulk  moduli  calculated 
from  ETB.  Also  listed  are  the  percentage  deviations 
from  the  mean  AB/B.  Note  that  all  AB  values  are  nega¬ 
tive.  The  magnitudes  of  AB  are  small  except  for 
GajAsSb  and  GajPSb  which  have  large  differences  in 
both  B  and  d  between  the  constituent  compounds.  Al¬ 
though  the  magnitudes  of  AB  get  larger  for  systems  with 
larger  differences  in  the  bond  lengths,  the  dependence 
does  not  seem  to  be  a  simple  function  of  the  bond-length 
difference.  The  uniformly  negative  AB  values  were  also 
obtained  from  LDF  by  Ferreira  et  for  the  GajAsSb 
alloys.  One  reason  for  the  negative  values  of  AB,  in  a 
very  qualitative  argument,  is  due  to  the  fact  that  the  bulk 
moduli  of  semiconductors  scales  inversely  as  high  powers 
of  the  lattice  constant, and  at  the  same  time,  the  alloy 
lattice  constant  is  approximated  well  by  the  mean 
value — the  Vegard  law.  This  qualitative  behavior  also 
comes  out  of  a  simple  VFF  analysis  for  disordered  al¬ 
loys." 

V.  SUMMARY  AND  CONCLUSION 

This  work  was  motivated  by  our  interest  in  under¬ 
standing  the  mechanism  of  ordering  for  the  semiconduc¬ 
tor  alloys  grown  by  MBE  and  MOCVD.  We  have  ap¬ 
plied  an  empirical  tight-binding  (ETB)  model  to  sys¬ 
tematically  interpolate  the  alloy  total  energies  from  those 
of  the  constituent  compounds.  Since  the  strain  energy 
makes  a  dominant  contribution  to  the  excess  energy  A£, 
particular  attention  has  been  given  the  elastic  properties, 
in  addition  to  the  lattice  constants  and  cohesive  energies. 
Our  calculated  bulk  excess  energies  A£  are  positive  for 
all  alloys  composed  of  lattice-mismatched  compounds, 
and  nearly  zero  for  the  lattice-matched  systems.  Based 
on  these  results,  we  conclude  that  all  the  ordered  semi¬ 
conductor  alloys  found  experimentally  are  not  in  their 
thermodynamic  stable  states  at  the  experimental  growth 
temperatures.  The  same  conclusion  can  also  be  drawn 
from  the  most  recent  LDF  calculations*"”'^’""””’*  list¬ 
ed  in  Table  VI.  We  note  that  several  earlier  theories*’”** 
that  concluded  a  stable  ordering  for  these  alloys  have  all 
been  revised’*  *’”™  (see  also  comments  in  Ref.  71). 

Our  calculation  also  generates  detailed  information 
about  the  structures  and  bulk  moduli  of  these  alloys. 
These  results  should  be  checked  experimentally. 

To  further  explore  the  stability  of  these  alloys,  we  ex¬ 
amined  the  energetics  for  spontaneous  ordering  when  the 
grown  materials  are  constrained  to  match  a  substrate  lat¬ 
tice.  Two  kinds  of  epitaxial  energies  are  calculated;  One, 
denoted  A£^,  corresponds  to  the  situation  in  which  the 
constituent  compounds  are  allowed  to  relax  fully  along 
the  growth  direction;  and  the  other,  the  hard-model 
A£^  does  not  allow  c  /a  relaxation.  Although  the  sign 
of  A£^  can  be  negative,  the  magnitudes  are  too  small  to 
account  for  the  observed  spontaneous  ordering.  Howev¬ 
er,  the  sign  and  magnitudes  of  A£"  for  a  number  of 
lattice-mismatched  alloys  are  found  to  be  comparable  to 
the  energies  needed  for  ordering  at  the  growth  tempera¬ 
tures.  Whether  or  not  this  is  a  plausible  mechanism 


deserves  a  more  detailed  investigation. 

Even  if  A£"  turns  out  to  be  a  possible  mechanism, 
based  on  the  considerations  presented  in  this  paper  we 
are  still  left  without  an  explanation  for  the  ordering  in 
the  lattice-matched  alloys  such  as  GaAlAs2.  We  do  not 
believe  that  any  refined  bulk  calculation  will  produce  an 
excess  energy  with  large  enough  magnitude  to  account 
for  the  observed  ordering  in  GaAlAs^.  On  the  other 
hand,  epitaxial  growth  is  very  surface  sensitive.  Because 
of  changes  in  the  bonding  character  at  surfaces,  e.g.,  dan¬ 
gling  bond,  charge  transfer,  and  reconstruction,  the  sur¬ 
face  structural  energies  behave  quite  differently  from 
those  in  the  bulk,  and  do  not  extrapolate  from  the  bulk, 
energies."  New  mechanisms  for  spontaneous  ordering 
may  emerge  from  surface  energetic  considerations.  Some 
hopeful  thoughts"  along  this  line  have  already  been  sug¬ 
gested. 
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APPENDIX:  EXCESS  ENERGY  AND  ORDERING 

The  excess  energies  A£,  Af^,,  and  A£jy  considered  in 
the  text  for  the  ordered  alloys  have  to  be  considered 
along  with  the  free  energies  of  other  possible  phases  in¬ 
cluding  the  disordered  alloy  in  order  to  determine  if  the 
ordered  phase  is  stable  at  a  given  temperature.  A 
schematic  mixing  free-energy  curve  AF  for  a  disordered 
pseudobinary  alloy  ^4  C  as  a  function  of  alloy  com¬ 
position  X  at  several  different  temperatures  is  shown  in 
Fig.  2.  Figure  2(a)  is  for  the  case  A£  >  0,  and  Fig.  2(b)  for 
A£<0.  (These  curves  are  similar  to  Fig.  1  in  Ref.  S7.)  If 
A£>0,  then  the  stable  phase  is  either  the  segregated 
phase  or  a  uniform  disordered  solution  depending  on 
whether  the  temperature  T  is  lower  or  greater  than  the 


(a)  AE  >  0  (b)  AE  <  0 


AC  X  BC  AC  X  BC 

FIG.  2.  Schematic  plots  of  the  mixing  free  energy  of  a  disor¬ 
dered  alloy  as  function  of  alloy  composition  x  (dotted  lines)  for 
(a)  A£  >  0  and  (b)  A£  <  0.  The  dashed  line  represented  A£  for 
an  ordered  alloy  ABC}. 
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critical  temperature  T„  for  phase  segregation  or  forming 
miscibility  gaps.  However,  if  A£<0,  then  the  ordered 
phase  is  stable  at  a  temperature  T  lower  than  the  critical 
temperature  for  the  order-disorder  transition. 

A  crude  estimate  of  the  magnitude  of  A£  in  Fig.  2(b) 
required  for  ordering  at  a  given  temperature  T  can  be 
made  by  assuming  that  the  disordered  phase  is  a  random 
alloy.  Then  in  the  present  units  of  energy  (for  an  ABC^ 
molecule),  the  magnitude  of  the  ordering  energy  defined 
as  A£o  =  |A£— AE/)!  must  be  greater  than  IkTlal, 
where  A£j,  is  the  mixing  energy  for  the  disordered  phase 
for  the  case  at  x  =4  and  T=0,  also  shown  in  Fig.  2(b). 
At  a  typical  epitaxial  growth  temperature  of  600  *C  (see 
Table  I),  the  minimum  ordering  energy  is  estimated  tc  be 
A£o  »  100  meV.  Taking  A£o  =  A£/2,  one  finds  a  critical 
value  of  A£  =  —  200  meV.  However,  if  a  strict  pair- 
potential  nearest-neighbor  model  is  used,  for 

both  the  CuAu  I  and  chalcopyrite  structures  and  the  re¬ 
quired  A£  value  becomes  —400  meV  in  this  estimate. 


Finally,  consider  the  implication  of  different  excess  en¬ 
ergies  to  ordering.  Let  us  take  GajAsSb  in  the  chalcopy¬ 
rite  structure  as  an  example,  for  which  the  three  excess 
energies  are  70,  —28,  and  —205  meV  for  A£,  ^E^,  and 
A£,^,  respectively  (see  Tables  VI,  VII,  and  VIII)  from  the 
TB  calculation.  The  positive  value  of  A£  corresponds  to 
the  case  of  Fig.  2(a),  which  shows  that  the  ordered  bulk 
phase  is  not  stable  at  any  temperature.  Although  the 
negative  value  A£^  =  —  28  meV  corresponds  to  the  case 
in  Fig.  2(b),  the  corresponding  critical  temperature  T,  is 
very  low.  This  implies  a  simple  constraint  that  the  lateral 
lattice  constants  of  the  grown  material  matched  to  the 
substrate  lattice  are  not  enough  to  produce  a  metastable 
ordered  alloy  at  the  experimental  growth  temperature  of 
600  *C.  However,  the  significant  negative  value  of 
A£,p  =  —205  meV,  resulting  from  a  further  constraint  in 
which  the  c  axis  relaxation  is  not  allowed,  may  raise  the 
Tf  in  Fig.  2(b)  into  the  range  of  the  growth  temperature 
and  make  the  metastable  ordered  phase  a  possibility. 
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INTRODUCTION 


1.  GOALS 

The  goals  of  this  pi^  are  to  review  the  cunent  state  of  knowledge  of  the  elastic  ccm- 
stants  of  elemental,  ccmqwund,  and  pseudobinary  alloy  semiconductcMrs.  To  accomplish  this 
objective,  we  will  review: 

•  Experimental  methods  currently  used  to  measure  dastic  constants, 

•  Experimental  results 

•  Binding  and  elastic  constant  theory 

•  Related  mechanical  properties 

Heavy  emphasis  is  placed  on  compariscxis  between  theory  and  experiment,  and  the  accu¬ 
racy  of  approximations  currently  in  vogue.  The  theories  discussed  range  from  first-principles 
methods,  requiring  heavy  ccxnputations,  to  parameterized  physical  models.  The  intent  is  to 
identify  a  logical  path  between  these  extremes,  and  thereby  provide  insight  into  the  ccmnection 
between  atom  potentials  and  semictmductor  mechanical  properties.  In  the  course  of  this  presen¬ 
tation,  there  are  a  number  of  instances  where  the  need  for  improvements  in  the  theory,  or  for 
additional  experimental  results,  is  encountered.  We  have  tried  to  highlight  these  situadtms  and 
suggest  possible  ranedies. 

2.  DERHinON  AND  CALCULATION  OF  ELASTIC  CONSTANTS 

The  themy  of  elasticity  of  solids  has  been  well  formulated  in  many  treatises  0Lx)ve,  1944; 
Landau  and  Lifshitz,  1959),  so  we  need  not  review  the  different  formalisms  and  conventions. 
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However,  as  we  do  wish  to  present  a  coherent  account  of  the  essence  of  the  theory,  it  is  necessary 
to  define  terms  and  describe  general  calculations  to  be  used  later. 

In  linear  elasticity  theory,  deformation  is  assumed  to  be  infinitesimal.  The  relative  dis¬ 
placement  veaOT  X  between  two  points  in  a  deformed  solid  is  related  to  the  corresponding  vector 
X  in  the  undeformed  solid  by  the  following  equation,  in  component  form,  through  a  9-component 
strain  tensor  e: 

x|,  =  Xa  +  Eap  Xp  .  (1) 

The  change  in  the  internal  energy  associated  with  e  is  also  small  and  will  be  denoted  UV, 
where  V  is  the  equilibrium  volume  of  a  solid.  Under  the  condition  that  the  entropy  and  the  elec¬ 
trostatic  displacement  field  are  amstant,  U  is  only  a  function  of  E,  and  is  quadratic  in  e: 

where  CoB^v  are  the  elastic  stifhiess  coefficients,  which  are  characteristic  properties  of  the 
solid.  From  this  definition,  C^ooB  =  CoB^o  is  required  for  U  to  be  an  analytical  function  of  E. 
Moreover  CqB^o  =  Cpapv  =  Capvp  is  also  required  to  ensure  that  U  is  zero  under  any  infinitesi¬ 
mal  rigid  rotation.  In  light  of  these  properties,  the  energy  can  be  expressed  in  terms  of  a  sym¬ 
metrical  strain  tensor  T]  as 

^  ”  2  ^0^  *^appv  .  (3) 

where  tiap  is  defined  as 

^aP~  2 

The  strain  tensor  Ti  is  a  thermodynamic  parameter  wi'h  stress  tensor  o  as  its  conjugate  variable 
(Bragger,  1964).  The  components  of  a  are  given  by 


D-6 


(5) 


®ap  “  ^apnv  ’Imv  • 

It  is  clear  that  o  is  also  a  symmetrical  tensor. 

The  most  frequently  used  notation  is  the  engineering  convention,  in  which  the  strain  ten- 
SOT  e  is  related  to  e  of  Eq.  (4)  by  eaa  ‘Hoa  for  foe  diagonal  components,  but  eap  =  2Tiap  for  a  ^ 
B.  Furthermore,  since  e  has  at  most  six  independent  components,  it  is  treated  as  a  6-conq)onent 
vector,  with  the  vector  con^nents  1  to  6  corresponding  respectively  to  the  tensor  components 
XX,  yy,  zz,  yz,  xz,  and  xy.  In  terms  of  e  the  strain  energy  density  is  written  as 

U^llijQje,^  ,  (6) 

where  Qj  can  be  identified  as  epopy  with  i  =  oP  and  j  >  pv.  Because  Qj  is  symmetrical,  it  has  at 
most  21  independent  components  for  any  crystaL  Crystal  syirimetries  reduce  this  number  further 
(Love,  1944,  Ashcroft  and  Mertnin,  1976).  For  a  cubic  lattice,  to  which  class  the  zincblende  and 
diamond  semiconductors  belong,  there  are  only  three  independent  components,  namely  Cn,  C12, 
and  C44. 

The  three  independent  elastic  constants  of  tiie  diamond  and  zincblende  (zb)  semiconduc¬ 
tors  can  be  calculated  by  considering  die  following  three  strains. 

(i)  Under  a  uniform  expansion,  which  changes  a  displacement  x  into  x'  =  (1  +  e)  x,  then 
Cl  ~  62  -  e3  s  e  and  the  other  strain  conqxments  are  zero;  die  elastic  energy  density  is  then  given 
by 

U  =  3(Cii+2Ci2)c2/2  .  (7) 

U  can  also  be  expressed  in  terms  of  the  adiabatic  bulk  modulus  B  defined  by  SP  =  -B(5VA^), 
where  V  is  the  crystal  volume,  5V  is  its  change  and  5P  is  the  corresponding  pressure  change. 

The  result  is  U  =  B(5V/V)2/2  =  9Be2/2,  because  the  dilatation  is  SVP/  =  3e  in  the  present  case. 
This  establishes  the  relationship 

B=(Cii-i-2Ci2)/3  .  (8) 
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(ii)  The  next  case  to  consider  is  a  tetragonal  shear  strain  e  which  changes  a  displacement 
according  to 

(x,  y,  2)  -» (x  +  ex,  y  -  ey,  r)  .  (9) 

The  only  nonzero  strain  components  are  ei  =  -C2  =  c.  Then  U  becomes 

U  =  (Cl  1  -  C12)  e^  .  (10) 

(iii)  To  calculate  C44  we  consider  a  shear  strain  e  which  changes  a  displacement 
according  to 

(x,  y.  2)  (x  +  cy/2,  y  +  ex/2,  z)  .  (11) 

This  strain  contains  05  as  the  only  nonzero  con^nent.  Because  C44  equals  the 
energy  density  is  simply 

U  =  C44e2/2  .  (12) 

Although  the  macroscopic  crystal  distortion  of  the  Bravais  lattice  caused  by  this  strain  is 
described  by  Eq.  (1 1),  microscopically  there  is  a  relative  displacement  u  =  (0,0  u),  the  so-called 
Kleimann  (1962)  internal  displacement,  between  two  successive  atomic  planes  perpendicular  to 
the  z  axis.  In  other  words,  tire  relative  displacements  between  the  atoms  on  the  s^me  fee  sublat¬ 
tice  are  governed  by  Eq.  (1 1),  but  there  is  an  additional  induced  relative  displacement  u  between 
the  two  sublattices.  The  directions  of  the  displacements  of  atoms  in  a  tetrahedral  cell  are  shown 
in  Figure  1.  In  calculations  one  can  use  an  arbitrary  infinitesitnal  pair  of  e  and  u  to  obtain  the 
coefficients  in  the  following  quadratic  expansion  of  the  strain  energy  density: 

U  =  <Du2/2  +  Deu+cSj^e2/2  ,  (13) 

where  the  force  function  O  is  related  to  the  transverse  optical  phonon  frequency  to  at  F  (the  cen¬ 
ter  of  Brillouin  zone)  by  d>  =  po)^  with  p.  being  the  reduced  mass.  In  Eq.  (12),  D  is  a  constant, 
and  would  be  the  shear  stiffness  coefficient  if  the  internal  displacement  were  not  allowed. 
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Kleimann  (1962)  defined  an  internal  displacement  parameter  which  is  related  to  the  equilibrium 
value  of  u  by  u  =  for  a  fixed  e,  where  a  is  the  lattice  constant  Taking  the  first  derivative  of 

U  with  respect  to  u  in  Eq.(13)  and  setting  it  equal  to  zero,  one  finds  that  the  ^  value  is  given  by 

C  =  -4D/(a<I>)  .  (14) 

Finally  the  sought-after  C44  is  given  by 

C44  =  c£>-C^a20/16  .  (15) 

Thus  the  internal  displacement  is  an  essential  pan  of  C44.  These  procedures  will  be  used  in  the 
theoretical  calculations  to  be  presented  in  subsequent  sections. 


3.  ELASTIC  CONSTANTS  AND  SONIC  WAVE  PROPAGATION 

The  purpose  of  this  section  is  to  give  the  reader  sufficient  information  to  measure  ultra¬ 
sonic  wave  velocities  and  to  deduce  from  them  tire  elastic  coefficients  of  materials  of  interest  to 
the  electronics  industry.  We  stan  from  the  definition  of  strain  and  develop  equations  that  relate 
the  elastic  coefficients  with  wave  propagation  velocities  in  various  crystallographic  directions. 
The  cases  taken  as  examples,  cubic  and  isotropic,  were  chosen  because  of  their  prevalent  role  in 
electronic  materials. 

A  number  of  techniques  used  to  measure  wave  propagation  velocities  in  materials  are 
discussed.  The  researcher's  decision  on  technique  depends  upon  a  number  of  factors,  such  as 
accuracy  needed,  size  and  crystallographic  orientation  of  available  samples,  equipment,  etc. 
Sample  preparation  is  significant  since  a  carele^ly  prepared  sample  yields  useless  velocity  data. 
Because  the  equipment  needs  vary  widely  among  the  techniques  that  can  be  used,  references 
containing  information  on  each  technique  are  given.  Key  papers  chosen  for  citation  were  picked 
for  their  readability  and  their  direct  application  to  the  measurement  technique.  Primary  emphasis 
has  been  given  to  techniques  that  are  generally  accepted,  and  that  give  results  that  are  under¬ 
stood.  Throughout  this  section  a  general  theory  of  elasticity  due  to  Mumaghan  (1951)  is  used. 
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a. 


Elaatic  to  Wave  Relatlonahip 


Consider,  as  before,  a  point  in  a  lossless  elastic  medium  at  rest,  whose  displacement  vec¬ 
tor,  X  =  (xo),  a  =  1, 2, 3  are  the  cartesian  coordinates  of  a  point  in  the  medium.  In  the  presence 


of  a  stress  wave  disturbance,  the  point  moves  to  a  new  location,  x'  =  (x'o)  at  time  t  The 
Lagrangian  strains,  T),  defined  consistently  with  Eqs.  (1)  and  (4)  but  with  higher  order  tenns 
retained  (Landau  and  Lifshitz,  1986;  Mumaghan,  19S1)  are  as  follows: 


=  1 


dXfi  dxv 


^  8uv  ^  dug  BUg 
dxv  8)^1  dx)i  dxv } 


(16) 


where  S^v  is  the  Kronecker  delta.  Einstein  summation  convention  over  repeated  indices  is  used 
in  this  section. 

By  conddering  the  internal  energy  per  unit  mass  of  the  methum  E(ra,  T|ap,S),  where  S  is 
the  entropy,  we  obtain  a  relationship  between  wave  prc^gation  and  the  internal  energy  of  the 
medium.  If  we  restria  this  discussion  to  a  medium  that  is  initially  unstressed,  then  the  internal 
energy  per  unit  volume  of  the  medium  is  pE(ra,tlc^,  S),  where  So  is  the  mass  density  of  the 
medium  in  the  unstrained  state.  By  expanding  this  in  a  power  series  of  the  Lagrangian  strains, 
we  obtain 

PoE(ra>  ‘nopi  S)  =  PoE(ro,  0,  S)  +  ^C^P%vTlap  +  ^C^voP76‘njiv'nopllT6'''-”  (^7) 


The  coefficients  C|ivap  and  C^vopiS  are  the  second-order  and  third-order  elastic  coefficients 
(adiabatic)  as  defined  by  Brogger  (1964): 

I  \ 

I  ^  (18) 

\^Hvdnap  / 
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^ivop^e  “  Po 


The  wave  equation  can  be  derived  (Goldstein.  1965,  pp.  347)  from  Lagrange's  equation 

*13^1  <l«a  iBaA  Ba^  ’ 


where  ^  »  dx^Jdt  and  L  is  given  by 


L  =  ^  po  uj^uj^  -  poB(xo,Tiop,S  ) 


Defining  the  particle  displacement  vector,  u,  as  before 


and  differentiating  it  gives 


u,i(f,t)  =  xj^-x^  , 


du|i 

dxv  dxv 


Combining  Eqs.  (22),  (23),  and  (16)  gives  the  strain  tensor  in  terms  of  the  particle  displacements, 

I  (duu  duv  dup^Bl 

=  ^  .  (24) 

^  \dxv  dxfi  ^x^  dxv  / 

Using  Eqs.  (21)  to  (24)  in  Eq.  (20),  and  retaining  first  mder  terms  in  du^/dxv,  gives 

Poi^-^(wg)  .  (25, 


We  assume  a  plane  wave  solution  to  Eq.  (25)  of  the  form 

U^s^'  UCOS(kvXv-0K) 


D-11 


where 


Cu  ==  unit  displacement  vector  in  the  |ith  direction 
u  =  (Ufi) «  particle  displacement  vecmr 
k  =  (kv)  =  wave  propagation  vector 
Cf)  =  angular  frequency. 

Substituting  Eq.  (26)  into  Eq.  (25),  we  obtain  the  eigenvalue-eigenvector  equation 

|CnvopKo«p-Pov28^|«0  (27) 

where 

(Kii)  =  the  set  of  directum  cosines  of  ic 
V  scon^  is  the  phase  speed 

K  =  k4k|>  indicates  the  direction  of  wave  propagation  in  the  medium. 

Fot  any  given  wave  propagatitm  directitm,  there  are  three  eigenvalue  solutions  to 
Eq.  (27);  these  conespcmd  to  one  quasiltmgitudinal  and  two  quasitiansverse  polarization  modes. 
Relationships  have  also  been  derived  for  wave  prqmgation  in  the  presence  of  residual  stresses 
and  for  retentitm  of  the  higher-oider  elastic  coefficients  (Cantrell,  1982;  Breazeale  and  Ford, 
1965). 

Because  both  the  stress  and  strain  tensors  are  symmetric,  tme  can  reduce  the  number  of 
independent  second-order  elastic  coefficients  to  21  (Landau  and  Lifshitz,  1986.  p.  32).  It  is  also 
convenient  to  use  the  Voight  (1928)  contraction  of  the  indices  as  in  Eq.  (6)  which  is  used  in  the 
remainder  of  this  section.  The  number  of  independent  elastic  coefticiems  varies  with  crystal 
structure^  and  is  listed  below. 
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Number  of  Indepoidoit 
CiTstal  Class  Elastic  Coeffidents 


Triclinic  21 

Mcmoclinic  13 

OrthoriumilHC  9 

Tetragonal  (C4,S4.C4h)  7 

Tetrag<»al  (Qv,  D2d>1^4  JD4h)  6 

Rhcanbcrfiedral  (C3,  $6 )  7 

Rhombohedral  (Csv.  D3.  Dsd)  6 

HexagCHud  5 

Cubic  3 

(Isotropic)  2 


Green  (1973)  and  others  have  solved  Eq.  (27)  for  isooopic  materials  and  for  cubic  crystals  in  the 
[100],  [1 10],  and  [1 1 1]  directions.  Uang  the  symmetry  properties  of  the  crystals,  one  obtains 
the  results  presented  in  subsection  b  for  cubic  crystals,  and  in  subsection  c  for  isotropic  solids. 


b.  Cubic  Crystals 

For  plane  waves  propagating  alcmg  the  [1(X)]  direction: 

(Ki*l,  K2  =  0,  K3=0)  ; 


as  Icmgitudinal  (ccnnpressional)  waves,  pure  mcxle  (particle  displacements:  ui  in  direction  of 
propagaticMi,  U2  and  U3  »  0 ) 


vi 


(28) 


or  as  transverse  (shear)  waves,  pure  nxxle  (particle  displacements:  ui  =  0,  U2  and  U3  are  perpen¬ 
dicular  to  the  direction  of  prcq)agation) 
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For  plane  waves  propagating  along  the  [1 10]  diiectitMi; 


(ki  =  K2  =  l/VI,  K3  =  0 )  ; 


as  longitudinal  (ctnnpressional)  waves,  pure  mode  (particle  di^lacements:  ui  =  U2 .03  s  0) 


(30) 


as  transverse  (shear)  waves,  pure  mode  (ui  ^  -U2,  U3  =  0 ) 

or  as  transverse  (shear)  waves,  pure  mode  (ui  ^  tt2  =  0,  U3  ^  0 ) 


For  plane  waves  pn^gating  al<mg  the  [111]  directitm: 

(Ki  *  1//J,  K2  =  1^,  K3  =  1//3)  ; 


as  longitudinal  (cmnpressional)  waves,  pure  mode  (ui  s  U2  =  U3) 

(Cii  2Ci2  4C44  )  . 


VI 


3pc 


(31) 


(32) 


(33) 


(34) 


or  as  transverse  (shear)  waves,  pure  mode  (ujKi  +  U2K2  +  U3IQ  =  0  or  particle  displacement  per¬ 
pendicular  to  wave  propagaticHi) 


-  /  (Cii  —  C12  +  C44y 
V5  =  v,.YLJ--5^ - ‘  ■ 


(35) 


c.  Isotropic  Solids 

Fot  the  case  of  isotrc^y,  all  directions  are  equivalent,  so  we  are  left  with  two  independent 
constants  (Lame*  constants).  The  elastic  coefficients  can  be  expressed  in  terms  of  the  Lame'  con¬ 
stants  (Green,  1973)  as  follows. 
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(36) 


Cii 

*  C22 

=  Css 

=  X  +  2\i 

Cl2 

-  Cis 

=  C2S 

»  X 

C21 

=  Csi 

=  Cs2 

«  X 

Cm 

»  C55 

=  C66 

For  plane  waves  pnqmgating  in  any  direction  (e.g.,  the  x-direcdon): 

(Kisl.  1C2«0.  )  . 

Mth  longitudinal  (cooqxressional)  wave,  pure  nxxle  (particle  di^lacen^nts;  ui  in  direction  of 
propagation,  U2  and  us  «  0 ) 


With  transverse  (shear)  waves,  pure  mode  (particle  di^lacements:  ui  =  0,  and  U2  and  us  per¬ 
pendicular  to  the  directitm  of  prt^mgadmi) 

V2  *=  vs  * 
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II.  MEASUREMENT  METHODS 


4.  VELOCITY  MEASUREMENTS 

To  measure  the  second-oider  elastic  coefBcioits  (SOEC),  one  can  determine  the  sound 
velocity  and  the  density  of  the  sample,  and  calculate  the  combination  of  SOEC  that  the  direction 
of  propagation  requires.  For  cubic  systems,  sound  velocities  measured  in  the  pure  mode  direc¬ 
tions  equivalent  to  [100],  [1 10],  and  [111]  make  it  possible  to  determine  the  three  independent 
elastic  ccmstants,  Cn,  C12.  and  C44.  Using  care  in  sample  preparation,  and  appropriate  coirec- 
d(His  for  b(Mid  thickness  and  diffiaction,  the  sound  velocities  can  be  determined  to  parts  in  10^. 
The  accuracy  of  a  velocity  determination  is  usually  limited  by  die  accuracy  of  the  path  length 
measurement  in  pulsed  and  continuous  wave  techniques.  \i^th  optical  techniques,  deflection 
angles,  wavelength  measurements,  and  ficeqirencies  are  determinant  factors. 

In  general,  there  is  little  problem  in  obtaining  a  sound  velociQr  measurement  on  the  order 
of  1%  uncertainty.  But  to  improve  on  this,  one  must  exercise  additional  care  in  the  preparation 
of  surfaces,  the  control  of  temperature,  the  determinaticHi  of  acoustic  path  length,  and  in  the 
determination  of  travel  time  of  the  acoustic  wave^.  For  single  transducer  configurations,  where  a 
transducer  is  used  to  send  and  receive  the  acousdc  wave,  the  base  equation,  for  determining  the 
velocity  using  pulsed  techniques,  or  pulse  echo,  without  corrections,  is  as  follows: 

v  =  ^  .  (39) 

where  V  is  the  wave  velocity,  L  is  die  sample  length,  and  T  is  the  round-trip  time.  Corrections^ 
are  made  in  the  evaluation  of  T,  and  vary  with  the  technique  used  in  the  measurement.  Correc¬ 
tions  for  bond  thickness  (McSkimin,  1961),  and  for  diffracdcm  effects  (Papadakis,  1961)  can  be 
measured  and/or  calculated.  Accuracies  of  stmie  of  the  nxne  frequently  used  pulsed  techniques 
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are  given  in  Table  1.  A  comparison  of  the  accuracies  of  various  techniques  are  discussed  by 
Papadalds  (1972, 1976).  Using  continuous  wave  techniques,  the  propagating  plane  wave  model, 
the  basic  equation  without  corrections  is 

v  =  2LAf  (40) 

where  Af  is  the  frequency  difference  between  two  adjacent  mechanical  resonance  modes.  Typi¬ 
cally,  velocity  measurement  accuracy  is  good  to  about  1%  to  10%.  A  correction  factor  (Chem, 
1981)  can  be  applied  to  the  right  side  of  Eq.  (39)  that  corrects  for  the  effects  of  the  transducer 
and  bond  on  the  mecharucal  resonances  of  the  sample.  When  applied,  the  inaccuracy  can  be  as 
low  as  5  parts  per  100,000,  neglecting  inaccmacies  in  path  length  measurements. 

If  the  sample  is  transparent,  optical  techniques  offer  a  convenient  method  to  measure 
sound  velocity.  For  some  of  these  techruques,  the  accuracy  is  on  order  of  parts  per  thousand. 
Still,  some  offer  accuracies  of  parts  per  ten  diousand  and  can  be  used  with  small  samples,  and 
very  high  frequency  ultrasonic  waves  (Breazeale  et  al.,  1981).  In  general  the  calculation  for 
velocity  involves  the  determination  of  acoustic  wavelength  in  the  medium,  by  using  the  equation 

V  *  fX  (41) 

where  f  is  the  frequency  of  the  sound  wave,  detmxiined  by  the  drive  frequency  of  the  transducer, 
and  X  is  the  wavelength.  Brillouin  scattering  also  makes  it  possible  to  measure  the  sound  veloc¬ 
ity  (Beyer  and  Letcher,  1969).  In  addition  to  measurement  of  an  angle,  the  accuracy  also 
depends  on  determination  of  the  wavelength  of  light  in  the  medium;  this  requires  a  detenrunation 
of  the  index  of  refraction  of  the  medium. 

5.  ULTRASONIC  MEASUREMENT  TECHNIQUES 

A  survey  of  techniques  and  details  for  measuring  ultrasonic  velocities  are  available  from 
various  sources  (Breazeale  et  al.,  1981,  Papadalds,  1976,  Truell  et  al.,  1969).  Consideration  in 
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this  work  will  be  given  to  three  classes  of  techniques  for  absolute  velocity  measurements.  A 
compilation  of  some  of  the  features  of  each  technique  are  listed  in  Table  2. 

a.  Sample  Preparation 

The  tolerance  selected  for  preparing  the  sample  depends  largely  upon  the  accuracy 
needed  in  the  determination.  If  a  1%  measurement  is  in  order,  surface  preparation  and  paral¬ 
lelism  ate  less  critical  than  if  an  accuracy  of  0.01%  is  desired.  Also  important  is  the  considera¬ 
tion  of  correction  factors  needed  to  compensare  for  bemd  thickness  and  diffraction  effects.  At 
ptesentdifihactioncorrecticMisexistonly  for  compressional  waves.  Theref  .  under  equivalent 
experimental  ccmditkms,  the  most  accurate  determinatiem  of  combinations  of  elastic  coefficients 
would  be  those  that  are  calculated  from  compressional  wave  velocities. 

It  is  assumed  diat  a  surface  on  the  sample  has  been  ground  to  optical  tolerances,  and  that 
the  crystallographic  direction  of  the  axis  of  this  surface  has  been  determined.  Typically,  the 
crystallographic  duecdons  are  measured  to  widtin  minutes  of  arc  (McSldmin  and  Andreatch, 
1964).  The  rerruining  critical  issues  are  the  parallelism  of  the  reflecting  surface  to  the  reference 
surface,  and  the  flatness  of  these  surfaces.  Parallelism  of  the  surfaces  can  be  measured,  for 
example,  with  an  autocolliminator  or  a  He-Ne  gas  laser  and  some  mirrors.  By  placing  the  sam¬ 
ple's  reference  surface  on  a  stationary  flat,  dust-free  surface,  rotating  the  sample,  and  measuring 
the  diameter  of  the  circle  traced  by  the  beam  reflected  firmn  the  sample's  tq>  surface,  and  measiu-- 
ing  the  total  path  length  of  the  beam  from  the  sanq)le  to  the  image  screen,  one  can  determine  the 
parallelism  of  the  surfaces.  The  surfaces  should  be  parallel  to  an  angle  better  than  0.01  ^acoustic/ 
transducer  diameter.  For  a  typical  1/2  inch  diameter,  10  MHz  transducer  and  a  typical  solid,  the 
surfaces  must  be  parallel  to  better  than  4  x  10^  radians  (1  degree,  23  minutes).  Higher  frequen¬ 
cies,  multiple  reflections,  and  continuous  wave  techniques  requite  proportionally  smaller 
tolerances. 

Typically,  one  also  tries  for  a  flatness  of  better  than  1/100  of  an  acoustic  wavelength  for 
accurate  determiiuition  of  transit  times.  If  die  resonant  frequency  of  the  transducer  is  10  MHz, 
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then  the  wavelength  in  a  solid  is  in  the  neighborhood  of  5  x  1(H  meters,  which  puts  the  flatness 
requirement  near  5  micrometers.  Using  an  optical  flat  and  Newton's  rings  analysis,  one  can 
determine  the  Harness  of  the  sample.  For  higher  frequency  transducers  and  continuous  wave 
techniques,  the  number  of  fringes  are  appropriately  decreased.  At  100  MHz,  pulsed  mode,  for 
example,  the  tolerance  is  down  to  less  than  several  fringes.  By  using  flat  lapping  surfaces,  this  is 
easily  achieved.  When  multiple  reflections  are  involved  in  the  measurement,  one  must  consider 
that  for  each  reflection,  changes  occur  in  the  wavefrx>nt  direction;  the  phase  change  across  the 
surface  of  the  transducer  at  each  reflection  is  due  to  lack  of  parallelism  between  the  faces.  For  a 
case  where  a  20  MHz  transducer  was  used,  and  multiple  reflections  employed  for  the  measure¬ 
ment,  Pq>adakis  (1967)  quotes  a  sample  surface  parallelism  and  Harness  of  better  than 
10^  inches  per  inch. 

b.  Piazoeloctric  Transducers 

SelecticHi  of  the  piezoelectric  transducer  depends  largely  upon  the  wave  mode,  the  elec¬ 
tronic  equipment  to  be  used  for  the  measurement,  and  the  perstmal  preference  of  the  researcher. 
In  general,  piezoelectric  crystals^  such  as  quartz  (high  electrical  impedance)  and  lithium  niobate, 
or  poled  ceramics  such  as  PZT  (lead  zirconate  titanate),  are  chosen.  Their  physical  and  electrical 
properties  are  covered  elsewhere  (O'Donnell  et  al..  1981).  Other  piezoelectric  transducers, 
including  polymeric  materials^,  are  also  available.  The  transducer  diameter  should  be  smaller 
than  the  sample  to  assure  that  propagation  modes  are  not  affected  by  the  location  of  lateral 
boundaries  (Tu  et  al.,  1955).  This  is  especially  in^)ortant  where  measurements  are  taken  on 
small  samples  which  may  have  their  lateral  sides  close  to  each  other.  One  also  must  consider  the 
problem  as  it  relates  to  wave  propagation,  sin<»  wave  modes  other  than  those  considered  here 
can  be  excited  in  materials  of  small  dimensions.  From  studies  on  cylindrical  specimens,  the  min¬ 
imum  sample  diameter  can  be  no  less  than  approximately  2.5  X,  where  X  is  the  wavelength  of  the 
ultrasound  in  the  medium.  The  transducer  diameter  is  chosen  after  the  frequency  is  selected. 
Generally,  the  transducer  diameter  should  be  no  larger  than  half  the  diameter  of  the  sample,  to 


D-20 


prevent  any  interference  in  the  measurement  from  reflections  off  the  lateral  boundaries  caused  by 
diffraction  effects. 

Selection  of  a  bonding  material  depends  upon  the  type  of  wave  (compressional  or  shear), 
and  the  temperature  range  through  which  the  measurements  are  to  be  taken.  At  room  tempera¬ 
ture,  a  good  choice  for  both  types  of  waves  is  Salol  or  phenyl  salicilate  benzoate  (Papadakis, 
1964),  while  Dow-Coming  DC-200  silicone  is  good  for  longitudinal  waves.  At  other  tempera¬ 
tures  (McSldmin,  1957),  <^er  bonding  materials,  such  as  Nonaq  stopcock  grease,  are  used  for 
compressional  and  shear  waves.  Various  resins  can  be  used  with  shear  wave  transducers;  com¬ 
mercially  prepared  bonding  materials  are  also  available^. 

Application  of  the  boiKl  requires  care  to  keep  surfaces  clean  and  free  of  dust.  The  bond 
should  be  as  thin  as  practical,  taking  care  not  to  break  the  transducer.  Often,  applying  some  heat 
helps  with  viscous  bonding  materials.  When  using  Salol  or  phenyl  benzoate  for  bonds,  certain 
procedures  can  be  followed  to  assure  uniform  bonds  (Papadakis,  1964). 

c.  Noncontacting  Tranaducart 

There  are  three  types  of  noncontacting  transducers:  capacitive  transducers, 
(compressional  wave  excitaticm  only)  (Cantrell  and  Breazeale,  1977),  electromagnetic  transduc¬ 
ers  (EMAT)  (Vasile  and  Thompson,  1977;  Johnson  and  Mase,  1984),  and  optically  stimulated 
acoustic  transducers  (Prosser  and  Green,  1985).  With  the  exception  of  the  electrostatic  trans¬ 
ducer,  (compressional  with  circular  piston  geometry),  diffraction  correction  data  has  not  been 
developed.  However,  bond  corrections  ate  not  necessary  as  these  methods  generate  the  wave 
directly  on  the  sample  surface.  Generally,  the  signal  levels  are  quite  small,  and  require  high  gain 
and  specialized  circuits  or  devices  to  bring  the  signals  to  usable  levels. 


6.  CORRECTIONS  TO  PULSE  MEASUREMENTS:  AN  EXAMPLE 

With  some  pulsed  systems,  and  with  care  in  preparing  the  sample  and  in  taking  the  mea¬ 
surements,  one  can  expect  round-trip  time  determinations  to  have  standard  deviations  in  the 
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neighborhood  of  one  hundred  picoseconds.  As  corrections  for  bond  thickness  and  dif^ction  are 
often  larger  than  this,  it  is  necessary  to  correct  for  these  sources  of  systematic  error  where  possi¬ 
ble.  As  an  example,  consider,  for  a  properly  prepared  sample,  that  the  pulse-echo  overlap  tech¬ 
nique  is  to  be  used  to  measure  the  round-trip  tin^  for  a  tone  burst.  The  time  between  the  first- 
received  and  the  second-received  echos  will  be  determined.  For  siiiq}licity,  we  will  treat  the  case 
of  only  one  round  trip  in  the  sample.  The  measured  time,  T,  is  composed  of  several  terms;  the 
true  travel  time,  5,  the  bond  thickness  contribution,  A^,  and  the  diffraction  contribution,  A^. 


a.  Bond  Thickness 

The  model  for  the  sample-bonded  transducer  system  has  been  developed  by  Williams  and 
Lamb  (1958),  and  has  been  used  extensively  by  McSkimin  (1961)  and  by  Papadakis  (1967). 

They  show  that  by  treating  the  transducer  bond  sample  system  as  an  acoustic  transmission  line,  it 
is  possible  to  calculate  the  effects  of  bond  thickness  on  the  measurement  of  transit  time. 
McSkimin  (1961)  shows  that  by  measuring  the  change  in  the  transit  time  when  the  system  is 
detuned  by  a  known  amount,  it  is  possible  to  determine  the  bond  thickness  in  terms  of  measured 
quantities.  This  permits  one  to  correct  for  the  transit  time  through  the  bond  material.  Papadakis 
(1967)  discusses  the  correction  in  some  detail,  and  applies  it  to  his  measurements  on  fused  quartz 
and  silicon. 

Consider  an  undamped  transducer  bonded  to  one  end  of  a  sample  with  some  bonding 
material  of  known  velocity  and  density;  a  relatively  long  wave  is  reflected  from  the  interface 
between  the  sample  and  the  transducer  bond  system.  The  reflected  wave  from  this  interface 
experiences  a  phase  shift  from  the  impedance  mismatch,  which  can  be  calculated  from  the 
model.  The  calculation  uses  the  real  and  imaginary  parts  of  the  effective  impedance  of  the  com¬ 
ponents  to  obtain  the  contribution  to  the  phase  shift.  By  detuning  thr  frequency  of  the  applied 
tone  burst,  we  can  measure  a  corresponding  change  in  transit  time.  Then,  a  comparison  with  the 
measured  values  permits  the  calculation  of  the  bond  thickness,  so  that  its  effect  on  the  measure¬ 
ment  can  be  determined.  The  approach  is  outlined  below. 
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(42) 


The  reflection  coefficient  for  the  system  described  above  is  given  by 

Eb_2d-Z. 

Ei  Zd+Z. 


where  Eb  is  the  reflected  pressure  wave,  Ei  is  the  incident  pressure  wave,  Zs  is  the  specific 
acoustic  impedance  of  the  sample  (Zs  =  pc,  where  p  is  the  mass  density,  and  c  is  the  wave  propa¬ 
gation  speed),  and  Zd  is  the  effective  specific  acoustic  impedance  of  the  transducer  bond  system, 
given  by  McSldmin  (1961), 


Zd=jZi 


1^1  tan  ki  A 1  +  tan  k2  A2 


tan  k2  A2  tan  ki  A I 


(43) 


where  Zi  and  Z2  are  die  specific  acoustic  impedances  of  the  bond  material  and  transducer  mate¬ 
rial  respectively.  Using  similar  conventions,  ki  and  k2  are  propagation  constants  in  the  respec¬ 
tive  materials,  as  A 1  and  A2  are  the  respective  bond  and  transducer  thicknesses,  and  j  is  the  yTT. 

The  phase  angle  (McSldmin,  1961)  is  calculated  by  writing  Eq.  (42)  in  complex  polar 
form,  and  gives 

T  =  aictan( -21^1^1  (44) 

nZdP-Z.2) 

wherel  2^|  is  the  nxxiulus  of  Zd,  and  (tc  -t-  y)  is  the  phase  shift  between  the  reflected  and  incident 
pressure  waves  impinging  on  the  interface  of  the  transducer  bond  system  with  the  sample. 

When  the  system  is  driven  at  the  transducer  resonant  firequency,  fr  can  be  calculated  by 
using  the  fact  that  the  transducer  at  fundamental  resonance  has  a  thickness  of  one  half  of  a 
wavelength;  then  |  Zd|  becomes  Zitan  ki  ^i,  which  is  generally  quite  small  for  a  thin  bond.  Fre¬ 
quencies  of  off-transducer  resonance,  generally  chosen  as  0.9  fr,  give  a  larger  phase  angle,  since 
the  dependence  of  Zd  upon  off-resonance  frequency  excitation  is  large.  As  outlined  in  McSldmin 
(1961)  and  Papadakis  (1967),  we  can  calculate  the  time  difference  between  phase-matched  echo 
trains,  AT,  caused  by  the  change  of  drive  frequency,  and  compare  it  with  experimental  results  to 
determine  the  correct  condition  for  overlap. 
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Assume  an  alignment  of,  for  example,  the  first  and  second  echos  in  the  sample.  The 


change  in  the  measured  time  between  echos  is  given  by 


AT 


(45) 


where  p  is  the  number  of  round  trips  (for  our  case,  p  =  1)  of  the  ultrasonic  tone  burst  in  the  sam¬ 
ple,  n  is  the  number  of  cycles  of  mismatch  in  the  tone  burst,  is  the  off-transducer  resonance 
drive  frequency,  ^  is  the  drive  frequency  at  transducer  resonance,  yt  is  the  phase  angle  at  the 
off-resonance  frequency  (radians),  and  Tktis  the  phase  angle  (radians)  at  the  transducer  resonance. 
One  can  calculate  AT  for  the  case  of  no  cycle  mismatch,  (n  =  0)  and  no  bond  thickness  (jr  =  0) 
from  Eqs.  (43)  through  (45).  The  measured  value  closest  to  the  calculated  value  determines  the 
correct  experimental  cycle  for  cycle  match.  Following  the  experimental  identification  of  the  cor¬ 
rect  match,  the  measured  AT  can  be  used  to  determine  the  correction  for  bond  thickness  by 
adjusting  A  i  in  Eq.  (43)  and  solving  Eq.  (44)  to  bring  Eq.  (45)  into  agreement  with  the  measured 
AT  values.  Tlie  phase  angle  at  resonance  can  be  determined,  and  the  travel  time  correction  for 
bond  thickness,  A^,  is  given  by 


(46) 


where  tk  and  ^  are  the  phase  angle,  and  drive  frequency  respectively,  both  taken  at  transducer 
resonance. 


b.  Diffraction  Corrections 

Diffraction  effects  for  compressional  waves  in  various  crystalline  symmetries  have  been 
treated  by  several  investigators  (Seki  et  al.,  1956;  Papadakis,  1963, 1964, 1966)  while  others 
have  treated  isotropic  media  (Benson  and  Kiyohara,  1974;  Khimunin,  1972;  Rogers  and  Van 
Buren,  1974).  Papadakis  (1972)  has  shown  that  without  diffraction  corrections,  one  can  expect 
errors  in  travel  times  as  large  as  0.25/fR.  He  also  discusses  (1972)  diffraction  corrections  for  the 
technique  of  pulse-echo  overlap  technique.  Using  the  dimensionless  quantity 
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(47) 


SsZL 
a? 

where  S  is  the  Seld  parameter,  a  is  the  transducer  radius,  X  is  the  wavelength  in  the  sample,  and  z 
is  the  distance  of  propagation,  he  writes  the  phase  correction  due  to  diffraction,  as 

.D.[(KSn)-<KSm)] 

2jcf 


(48) 


where  is  the  phase  shift  in  radians  of  the  received  wave  front  that  is  due  to  diffraction  effects. 
For  the  example,  the  sample  is  of  length  L  which  gives,  Sn  =  2LX/a?,  and  Sm  =  4LX/a2. 

The  true  travel  time,  S,  can  be  written  in  terms  of  the  measured  time,  T,  and  the 
corrections, 

8  =  T  +  a‘’  +  A*^  .  (49) 

The  thickness  of  the  sample,  L,  is  measured  by  ccmventional  means,  such  as  a  high  precision 

micrometo’,  and  the  velocity  of  sound  is  given  by 

v  =  2L  .  (50) 

5 


7.  OPTICAL  TECHNIQUES 

Optical  techniques  have  been  used  with  success  in  the  measurement  of  sound  velocities  in 
transparent  media.  The  methods  include  diffraction  and  scattering  of  light  by  sound  waves.  A 
direct  method  of  optically  measuring  an  acoustic  wavelength  in  the  material  can  also  be  used. 

The  diffraction  techniques  considered  here  permit  the  determination  of  the  wavelength  of 
sound.  The  wavelength,  X*,  is  combined  with  the  ultrasonic  frequency,  f,  of  the  sound  beam  to 
calculate  its  velocity,  according  to  the  expression, 

a*  =  v  ,  (51) 

where  v  is  the  speed  of  sound  in  the  sample.  In  any  optical  technique,  however,  one  must  specify 
the  type  of  diffraction  experienced  by  the  light 
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a.  Fraunhofer  Diffraction  of  Light  by  Sound 

Consider  an  ultrasonic  transducer,  bonded  to  a  transparent  solid,  generating  ultrasonic 
compressional  waves  that  propagate  into  the  solid.  Further  suppose  that  the  sound  wave  encoun¬ 
ters  light  traversing  the  same  medium  so  diat  the  light  is  diffracted.  There  are  two  different 
physical  regimes  that  can  produce  the  diffracdtxi  effects.  One  involves  the  formation  of  a  ccnru- 
gadon  in  the  phase  fronts  of  the  light  that  is  due  to  die  ^adal  variation  in  the  index  of  refraction 
of  the  solid,  as  caused  by  the  sound  waves.  This  is  called  Raman-Nath  diffraction  (Raman  and 
Nath,  1935, 1935a,  1936, 1936a,  1936b;  Bom  and  Wolf,  1970).  The  second  regime  involves  the 
reflection  of  light  from  the  evenly  spaced  crests  of  the  sound  waves.  These  reflections  occur 
under  some  conditions  that  are  similar  to  x-ray  diffraction  by  a  crystal  lattice;  this  is  called  Bragg 
diffraction  (Bhatia  and  Noble,  1953).  Both  types  of  diffraction  can  be  used  to  determine  the 
sound  velocity  of  the  material. 

In  order  to  determine  which  type  of  difriraction  effect  predominates  (Nomoto  1942),  a 
dimensionless  parameter  (Klein  et  al.,  1965),  Q,  is  defrned  as 

where  K*  is  the  ultrasonic  propagation  constanL  L  is  the  width  of  the  ultrasonic  beam,  po  is  the 
index  of  refraction,  and  K  is  the  propagation  constant  of  light  in  vacuum.  If  Q  >  9,  one  has 
Bragg  diffraction.  If  Q  <  1,  Raman-Nath  diffraction  occurs.  For  1  <  Q  <  9,  the  diffraction  is 
mixed.  For  illustrative  purposes,  consider  a  typical  transparent  solid  (Csound  5  x  lO^  m/sec, 
index  of  refraction  »1.5,  and  ultrasonic  beam  width  •>1.27  x  10-7  m),  illuminated  with  light  at 
wavelength  632.8  nm.  If  the  ultrasonic  frequency  is  approximately  27  MHz  or  less,  the  interac¬ 
tion  satisfies  Raman-Nath  diffraction  conditions.  If  the  frequency  is  greater  than  approximately 
81  MHz,  the  interaction  is  governed  by  Bragg  diffraction  conditions. 
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b.  Raman^Nath  Diffraction 

For  the  case  of  Raman-Nath  diffraction,  we  consider  the  light  beam  impinging  on  the 
medium  at  an  angle,  <{>,  with  the  light  normal  to  the  sound  beam.  The  location  of  the  diffraction 

orders  are  given  by  the  expression 

sin(0n  +  ^)-sin((|>)  =  ^l^  (53) 

X* 

where  <|>  is  the  angle  of  incidence,  6  is  the  angle  of  diffiactimi,  X  is  the  wavelength  of  light  in  the 
medium,  X*  is  the  ultrasonic  wavelength,  and  n  is  an  integer.  One  can  experimentally  set  <t>  =  0, 
which  reduces  Eq.  (53)  to 

sin(e«)  =  0^  (54) 

X* 

The  ultrasonic  wavelengdi  can  be  determined  from  Eq.  (53).  With  the  measurement  of  the  ultra¬ 
sonic  frequency,  one  can  determine  the  sound  velocity  of  the  cmnpressional  wave  (Barnes  and 
Hiederruum,  1957).  In  determining  the  wavelength  naeasurement  and  its  uncertainty,  it  is  neces¬ 
sary  to  make  an  analysis  of  the  t^tical  setup,  including  die  effects  of  Snell's  Law  at  interfaces. 


c.  Bragg  Diffraction 

For  the  case  of  Bragg  diffraction,  the  angle  of  incidence,  is  set  to  the  angle  of  diffrac¬ 
tion,  and 

nX  =  2X*sin0B  (55) 

where  ^  is  the  Bragg  angle.  Bragg  diffraction  is  used  to  measure  wave  velocities  in  the  fre¬ 
quency  range  from  approximately  100  MHz  to  the  low  end  of  the  gigahertz  scale.  For  example 
(Krischer,  1968),  the  technique  has  been  used  to  measure  wave  velocity  to  an  estimated  accuracy 
of  better  than  0.1%.  It  is  also  useful  in  measuring  the  local  velocities  within  a  sample  (Simondet 
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et  al.,  1976;  Michard  and  Penin,  1978).  Measurements  with  inecision  of  better  than  0.01%  in 
homogeneous  samples  have  been  reported  (Simondet  et  al.,  1976). 

d.  Direct  Measurements  (Fresnel  Diffraction) 

Consider  a  standing  ultrasonic  wave  in  a  sample  through  which  collimated  light  is  passed 
so  that  the  collimated  light  beam  is  peipendicular  to  the  sound  beam.  A  measuring  microscope 
OT  similar  optical  device  is  focused  so  that  the  image  of  the  wave  can  be  viewed  at  the  instrument 
focal  plane.  Because  the  images  of  the  wave  fronts  are  X/2  apait,  it  is  possible  to  determine  the 
ultrasonic  velocity  from  measurements  of  ultrasonic  frequency  and  measurements  of  the  wave¬ 
length  in  the  medium.  The  technique  is  sensitive  enough  to  detect  local  variations  in  velocity 
greater  than  0.01%  (Hiedemann  and  Hoesch,  1934, 1937;  Mayer  and  Hiedemann,  1958,1959). 

•.  BriHouln  Scattering 

Consider  the  scattering  of  a  photon  by  a  high  frequency  phonon  traveling  in  a  specific 
direction  within  a  crystal.  Application  of  ctmservation  of  en^gy  and  momentum  to  the  scatter¬ 
ing,  coupled  with  the  iqrptoximation  that  any  photon  frequency  shift  is  small  (Benedek  and 
Fiitsch,  lSi66;  Beyer  and  Letcher,  1969,  pg.  47-50),  gives 

V*  *  ±  2v  sin  [l-j  ,  (56) 

where  V  is  the  photon  frequency,  v*is  the  phonon  frequency  as  well  as  the  difference  in  fre¬ 
quency  of  the  scattered  photons,  C  is  the  speed  of  the  phomn  in  the  medium,  v  is  the  speed  of  the 
phonon,  and  6  is  the  scattering  angle.  If  desirable,  the  Bragg  condition  can  be  used  for  construc¬ 
tive  reinforcement  by  adjusting  6  (Pollard,  1965).  The  technique  gives  three  lines  in  the  scat¬ 
tered  photon  spectra,  that  can  be  separated  and  measured  with  appropriate  optical  devices,  such 
as  Fabry-Perot  interferometers,  to  obtain  v*.  Eq.  (56)  can  be  solved  for  v  in  terms  of  the  other 
quantities.  Brillouin  scattering  is  useful  in  the  investigation  of  the  sound  velocity  of  a  material 
near  a  phase  transitkMi  (Fleury,  1970,  p.  37-42).  The  technique  has  been  used  for  both  longitudi- 
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nal  and  mixed  inodes.  However,  in  one  study  on  cubic  crystals,  no  Brillouin  scattering  was 
observed  from  an  acoustic  branch  which  consisted  of  pure  transverse  waves  (Benedek  and 
Fiitsch,  1966).  Uncertainties  in  the  index  of  refraction,  the  scattering  angle,  and  the  width  of  the 
Stokes  and  anti-Stokes  lines  influence  the  accuracy  of  the  detomination.  The  precision  is  con¬ 
siderably  better,  however,  with  values  of  0.1%  as  mentioned. 

In  stimulated  Brillouin  scattering,  the  photon  scattering  process  is  dependent  upon  the 
intensity  of  the  radiation  striking  the  surface.  With  high  enough  light  intensity,  nonlinear  effects 
occur,  which  result  in  scattering  by  frequencies  and  harmonics  created  by  harmonic  generation 
(Breuer,  1965).  A  large  buildup  of  acoustic  intensity,  both  compressional  and  shear  waves, 
accompanies  a  threshold  in  optical  intensity  (Chiao  et  al.,  1964).  Other  effects  include  the  pos¬ 
sibility  of  sample  destruction  from  the  intense  radiation,  and  the  line  pulling  effects  of  the  laser 
cavity  on  the  scattered  light,  which  affects  accuracy  (Fleury,  1970,  p.  57-58).  Amplified  acoustic 
frequencies  have  been  reported  as  high  as  60  GHz. 
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Hi.  THEORETICAL  AND  EXPERIMENTAL  RESULTS 


8.  4501/710  THEORY 

It  is  clear  from  Section  2  that  the  calculadcm  of  elastic  constants  requires  an  accurate 
computation  of  the  variation  in  the  total  energy  of  a  solid,  from  equilibrium  to  distorted  config¬ 
urations.  Thanks  to  the  availability  of  powerful  computers,  great  advances  in  ab  initio  total- 
energy  calculation  have  been  in  recent  years.  In  this  section  we  briefly  discuss  the  approx¬ 
imations  used  in  ab  initio  theory  and  summarize  the  calculated  results. 

The  Hamiltonian  of  a  solid  consists  of  five  parts:  the  two  kinetic  energies  of  the  ions  and 
electrons,  Ki  and  Ke  respectively,  and  the  potential  energies  Un  among  ions,  Uee  among  elec¬ 
trons.  and  Uei  between  electrcHis  and  ions.  The  Bom-Oppenheimer  (1927)  adiabatic  approxima¬ 
tion  is  a  simple  way  to  separate  the  electronic  from  the  ionic  variables.  In  this  approximation, 
because  the  ion  speed  is  at  least  two  orders  smaller  than  the  electron  speed,  one  freezes  the  ionic 
motion  in  a  configuration  specified  by  a  set  of  ion  position  vectors  Rn,  then  solves  the 
Schrodinger  equation  for  that  part  of  the  Hamiltonian  that  involves  the  set  of  electronic  coor¬ 
dinates  (ri): 

HV,({ri).  {R.l)-E,((R.))<|<,({f.),)(R.)  .  (57) 

where  H  =  K*  +  Uec  +Uci.  Thus  the  energy  Eyis  a  function  of  the  ionic  configuration.  The  low- 
est  energy  curve  of  the  sum  of  Eg  =  Ey  +  Un  as  a  functimi  of  (Rn)  then  serves  as  the  potential 
energy  for  the  ionic  motion.  A  Taylor  expansion  of  Eg  about  its  minimum  value  Eq,  i.e.,  the 
quantum  mechanical  analog  to  Eq.  (17),  takes  the  form 

Eg  (Rn)  ~  ^  2  ^ ’  (58) 
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where  una  is  the  component  of  the  small  displacement  Rn  -  Rn(0)  with  {  Rn(0) }  the 
equilibrium  ionic  positions  at  the  minimum  energy  Eq-  The  force  functions  <I>^  are  the  second 
derivatives  of  Eg  with  respect  to  the  displacements  evaluated  at  the  ion  equilibrium  positions. 

The  force  functions  are  directly  related  to  the  elastic  constants.  For  example,  for  a  Bravais  lattice 
they  are  given  by  Aschcroft  and  Meimin  (1976)  as 

Cc^— ,  (59) 

where  the  sum  is  over  all  the  lattice  vectors  L. 

Eq.  (57)  is  computationally  the  most  difficult  part  of  the  problem,  because  it  deals  with 
about  10^  electrons  that  are  interacting  with  each  other  and  with  the  ions;  the  wave  functions 
have  to  be  the  properly  antisymmettized  many-body  functions.  Self-consistent  density-func¬ 
tional  theory  (SCDFT)  (Hohenberg  and  Khon,  1964;  Kohn  and  Sham,  1965;  Callaway  and 
March,  1984),  which  casts  the  Hamiltonian  into  a  density  functional,  reduces  the  many-body 
problem  to  an  effective  one-electron  problem.  This  theory  has  been  tested  in  many  crystalline 
solids  and  the  resulting  elastic  constants  have  been  excellent,  especially  for  semiconductors. 

In  SCDFT,  the  ground  state  energy  of  a  solid  is  completely  specified  by  single-particle 
wave  functions  of  the  occupied  states  {<|>v}.  First  the  electron  density  is  given  by 

p(r)=L,|(|l,e)f  .  (60) 

Then  the  ground  state  energy  is  computed  as  follows: 

Eg  =  Ke  +  U«  +  Ud  +  Un  +  Uxc  ,  (61) 


where  the  different  terms  are  given  by  the  expressions 

K«  =  2:v  J^vIrj^^lKrlrld^r  , 


Uee 


d^r  dV  , 


(62) 


(63) 
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Ud  =  -«2  2:„z„  j  pW  , 

(64) 

Ufl  *  "^^mnZmZn/l  Rm  “Rn  1  * 

(65) 

Uxc=  f  p{?)exc[p{?)]d^r  . 

(66) 

Note  that  n  =  m  is  excluded  in  the  sum  in  Eq.  (65).  The  meaning  and  the  notations  of  the  above 
equations  are  mostly  self-evident,  except  the  e^rCpl  in  Eq.  (66);  this  is  the  conection  tenn  arising 
from  the  many-body  exchange  and  cmrelation  effects.  The  square  bracket  means  Cxc  is  a  func¬ 
tional  of  p.  Several  different  expressions  for  Cxc  as  a  function  of  p,  available  in  the  literature  (for 
example,  Wigner,  1934;  Hedin  and  Lunquist,  1971;  CepaXty  and  Adler,  1980;  Perdew  and 
Zunger,  1981),  have  yielded  similar  results  fm*  the  structural  propeities  of  semiconductors.  A 
minimization  of  Eg  with  respect  to^  with  the  ctHistraint  that  the  total  number  of  electrons  is  a 
constant,  leads  to  the  familiar  Schroedinger  single-particle  equation: 


0v<f)  =  6v^(r) 


(67) 


where  the  Ey  is  a  Lagrange  multiplier  and  V  is  an  effective  one-electron  potential  containing 
three  parts: 


V=Vee  +  Vd  +  Vxc 


(68) 


(69) 


Vei  is  the  Coulomb  potential  due  to  ionic  charges  Zn 

Vd  =  -e25:.v|f-R„|  .  ao) 
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(71) 


Finally,  the  exchange-anrelation  potential  Vxc  is  given  by 

Vxc  =  exc-»-p—  . 

dp 

Thus  Eqs.  (59)  and  (67)  through  (70)  fonn  a  repeated  loop,  p  ->  V  ->(j>  -» and  the  cal¬ 
culation  must  be  iterated  until  self-consistency  is  achieved.  Following  this  recipe  for  calculating 
the  total  energy,  tme  calculates  Eg  as  a  function  of  ionic  positions,  finds  the  equilibrium  configu¬ 
ration,  then  inqwses  a  strain  and  calculates  the  strain  energy  to  deduce  the  elastic  constants,  fol¬ 
lowing  the  prescription  of  Sectitm  4.  The  problem  then  becomes  strictly  computational.  The 
most  challenging  task  is  an  accurate  solution  for  the  single-particle  eigen  states  of  Eq.  (67).  For 
a  crystalline  solid,  lattice  translational  symmetry  simplifies  the  problem,  and  band-structure 
techniques  can  be  applied  to  obtain  the  solution.  Because  the  strain  energy  is  many  orders  of 
magnitude  smaller  than  the  total  energy,  very  precise  computation  is  required  if  one  hopes  to 
obtain  meaningful  elastic  constants.  So  far  at  least  two  band  structure  methods  have  been 
demonstrated  as  reliable  for  all  three  elastic  constants:  the  plane-wave  method  using  pseudopo¬ 
tentials  (PP-PW)  (Nielsen  and  Martin,  1983, 198Sa),  and  the  full-potential  linearized-muffin-tin- 
orbital  method  (FP-LMTO)  (Methfessel  et  aL,  1989).  Although  die  full-potential  argumented 
plane-wave  method  (FP-APW)  (Krahauer  at  aL,  1979;  Wimmer  et  al.,  1981;  Wei  and  Krahauer, 
1985;  Ferreira  et  al.,  1989)  has  produced  exceUent  lattice  constants,  structural  energies,  and  bulk 
moduli,  the  complete  semiconductor  elastic  constants  based  on  this  method  are  not  yet  available. 

Even  if  the  total  energies  at  different  distortions  can  be  calculated  accurately,  there  is  still 
the  problem  of  searching  for  the  equilibrium  atomic  positions  in  a  distorted  crystal,  and  the 
numerical  determination  of  elastic  constants  from  energy  differences.  If  the  strain  energy  can  be 

t 

calculated  directly  without  taking  the  difference  between  two  large  energies,  or  if  the  derivatives 
can  be  calculated  directly,  not  only  can  the  computation  time  be  shortened  but  the  numerical 
errors  will  also  be  reduced.  The  quantum  mechanical  theory  of  forces  and  stresses  of  Nielson 
and  Martin  (1985b),  and  the  closely  related  direct  calculation  of  elastic  constants  from  linear 
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response  theory  of  Baroni  et  al.  (1987)  represent  the  status  of  efforts  in  this  direction.  The 
former  has  been  carried  out  for  all  three  elastic  constants  for  Si,  Ge,  and  GaAs  (Nielsen  and 
Martin,  1985a),  while  the  latter  has  been  done  only  on  the  bulk  modulus  for  Si;  both  are  based  on 
the  PW-PP  method.  Table  1  shows  a  comparison  between  theoretical  calculations  and  experi¬ 
mental  results.  From  this  comparison,  it  is  fair  to  say  that  we  have  a  very  reliable  ab  initio  theory 
for  the  elastic  constants  for  crystalline  semiconductors  based  on  the  self-consistent  local  density- 
functional  theory.  Note  that  Table  1  also  lists  the  results  from  an  empirical  tight-binding  (TB) 
thecny  to  be  discussed  in  Section  10. 

9.  VALENCE  FORCE  FIELD  MODEL 

The  preceding  section  showed  that  ab  initio  theory  for  the  elastic  constants  requires 
complicated  computations.  Accurate  ab  initio  calculations  for  semiconductors  have  been 
obtained  only  recently  and  only  for  several  systems.  On  the  other  hand,  phenomenological 
microscopic  models  of  elastic  constants  for  all  semiconductors  have  been  available  for  some 
time.  Of  these,  the  valence  force-field  model  (VFF)  is  perhaps  the  simplest  and  the  most  useful. 
This  topic  has  been  reviewed  and  well  analyzed  in  a  paper  by  Martin  (1970),  and  its  conclusions 
constitute  the  main  body  of  this  section. 


a.  Diamond  Structure 


The  original  VFF  model  by  Musgrave  and  Pople  (1962)  was  for  the  diamond  structure,  in 
which  the  elastic  energy  is  a  quadratic  form,  in  terms  of  the  changes  in  each  bond  length  Ari,  in 
bond  angles  A6ij,  and  in  the  products  Aq  Arj  and  A6ij,  between  nearest  neighbor  bonds.  For  elas¬ 
tic  constants,  Keating  (1966)  showed  that  the  VFF  can  be  simpliHed  by  the  following  approxi¬ 
mation  to  the  elastic  energy  of  the  crystal: 


(72) 
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where  the  bond  index  i  runs  over  all  the  bonds,  but  the  i  and  j  sum  only  over  those  pairs  of  bonds 
that  are  connected  to  a  common  atom.  In  Eq.  (72)  d  is  the  equilibrium  bond  length  and  A(rj  •  rj ) 
is  the  change  in  the  dot  product  of  the  two  bond  vectors  which  start  at  the  common  atom,  point 
along  the  bond  directions,  and  end  at  the  the  first-  neighbor  atoms.  Following  the  calculational 
procedure  described  in  Section  2  for  uniform  expansion,  the  U  in  the  VFF  under  a  uniform 
expansion  can  be  shown  to  be  U  =  2(3a  +  p)  where  ft  =  aV4  is  the  equilibrium  volume 

per  unit  cell.  Thus 

B=(Cii+2Ci2)/3  =  (a+p/3)/a  .  (73) 

For  the  shear  strain  described  by  Eq.  (9),  Eq.  (72)  yields  U  =  4Pe^/a.  Thus  according  to  Eq.  (10), 

Cii-Ci2  =  4p/a  .  (74) 

For  the  shear  strain  described  by  Eq.  (11)  and  with  an  internal  displacement  u  =  (0,0,u),  Eq.  (72) 
yields  the  following  expression, 

U=[a(e-Ti)2  +  p(e+Ti)2]/(8a)  ,  (75) 

where  0  =  11  a/4.  A  comparison  between  Eqs.  (13)  and  (74)  shows  that  <j>  =  16  (a  +  |i)/a3,  D  = 

-  P)/a2  and  -  (a  +  P)/a.  Using  these  results  in  Eq.  (14),  we  find  that  the  Kleimann 

internal  displacement  parameter  in  the  present  model  is  given  by 

C  =  (a - P) / (a  +  p)  =  2C,2/(Cii  +  C12)  .  (76) 

Eq.  (15)  then  produces 

C44  =  2ap/[(a  +  p)a]  .  (77) 

The  three  elastic  constants  given  above  are  not  independent,  and  can  be  shown  to  relate  to  each 
other  by  the  Keating  Identity  (1966),  or 

Ik  =  2C44  (Cii  +  Ci2)/[(Cn -Ci2)(Cii  +  3C12)]  =  1  .  (78) 
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b.  Zlncblende  Structure  and  Coulomb  Force 

The  Keating  Identity  (1966)  holds  very  well  for  systems  with  the  diamond  structure  but 
not  so  well  for  the  zincblende  compounds  (see  Table  3).  One  obvious  difference  between  the 
two  stroctures  is  the  presence  of  Coulomb  intoactions  arising  from  charge  shifts  between  the 
cation  and  anion  sublattices  in  zb  semiconductors.  Martin  (1970)  incorporated  Blackman's 
(1959)  treatment  of  the  Coulomb  forces  in  die  Keating  VFF  in  the  following  maimer.  Hrst,  the 
Coulomb  energy  was  treated  as  ^  screened  Madelung  energy  Em-  For  example,  in  a  uniformly 
expanded  crystal  with  a  bond  length  r,  the  Coulomb  energy  was  taken  to  be  Em  = 
-NaMZ*^/(er),  where  N  is  the  total  number  of  unit  cells,  um  =  1-6381  is  the  Madelung 
constant,  and  Z^^/e  is  the  effective  charge  defined  by  the  optic-mode  splitting: 

S  =  Z*2 CO?) /(4jie2)  .  (79) 

In  Eq.  (79)  cui  and  (Dt  are  respectively  the  longitudinal  and  transverse  phonon  frequencies  in  the 
long-wave-length  limit.  Then  to  counterbalance  the  Coulomb  forces,  a  repulsive  force  term  was 
added  and  assumed  to  contribute  to  the  bond-stretching  energy  in  the  form 

AER  =  -X<XMZ*VAri/(4ed2)  .  (80) 

i 

With  the  above  two  contributions  added,  the  total  strain  energy  is  AEj  =  AE  +  AEm  +  AEr, 
where  AE  is  the  VFF  contribution  in  Eq.  (72)  and  AEm  is  the  change  in  the  Madelung  energy. 
The  energies  AEt  are  expanded  in  a  power  series,  and  only  terms  up  to  the  second  power  in  the 
strain  are  kept.  The  AEm  contributions  arising  from  fixed  values  of  the  charge  shift  S  =  Z*^/e 
on  the  atomic  sites  under  different  strains  were  worked  out  by  Blackman  (1959).  Using  these 
results  and  defining 

s  =  e2/d^S  =  e2Z*2Xd^e)  ,  (81) 
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Martin  (1970)  obtained  the  following  modified  expressions  for  the  elastic  constants: 


Cii  +  2Ci2  =  (3a-»-P)/a-0.355s  ,  (82) 

Cl  1  -  2Ci2  =  4p/a  +  0.053s  .  (83) 

C=[(a-P)/a-0.294s]/CM  .  (84) 

C44  =  (a  +  p)/a-0.136s-CMC2  ,  (85) 

where  Cm  is  defined  as 

Cm  =  (a +  p)/ a -0.266s  .  (86) 

The  above  equations  can  be  combined  to  yield 

C  =  (2Ci2-C')/{Cn+Ci2-Cr)  .  (87) 


where  C  -  0.314s.  Since  the  extra  parameter  s  is  fixed  by  the  optic  nxxles  and  the  bond  length, 
the  above  results  cornice  into  a  new  identity,  the  Martin  Identity  (1970): 

I„ - 2Cm(C,H-C.2-0 -  (ggj 

(Cii  -  Ci2)  (Cii  +  3Ci2  -2cr )  +  0.831C  (Cn  +  C12) 

Table  3  lists  a  set  of  experimental  values  of  the  elastic  constants  and  the  s  values  for  a  number  of 
diamond  and  zincblende  semiconductors.  These  values  are  used  to  compute  die  force  constants 
a  and  P  and  the  identity  expressions  Ik  and  Im  given  in  Eqs.  (77)  and  (88)  respectively.  Several 
sets  of  data  are  quoted  for  some  of  die  systems  to  show  the  uncertainties  in  the  experiments  for 
these  systems.  The  table  results  clearly  show  that  the  inclusion  of  the  Coulomb  energies 
improves  the  identity  relation;  the  deviations  of  Im  from  unity  are  15%  or  less.  Also  listed  are 
the  values  for  another  identity  relation,  IboM  from  Eq.  (119),  based  on  a  tight-binding  model  to 
be  discussed  in  Section  10.  Martin  further  studied  trends  as  functions  of  the  bond  lengths  d  and 
the  ionicity  scale  fi  of  Phillips  and  Van  Vechten  (1969).  He  found  that  a  scales  roughly  as  l/d^, 
i.e., 

ad^  /  e^  =  constant  ,  (89) 
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where  e  is  the  electron  charge.  He  also  found  the  ratio  between  the  bond-angle  and  bond¬ 
stretching  forces  tend  to  decrease  as  fi  increases  and  scales  roughly  as 

P/a-ci_fi  (90) 

He  further  observed  that  if  S  of  Eq.  (79)  is  set  equal  to  fi  and  if  the  a  and  P  values  are  extrapo¬ 
lated,  using  Eqs.  (89)  and  (90),  from  those  fitted  to  the  average  values  of  the  B  and  Ci  i  -  Ci2  for 
Si  and  Ge,  then  aU  the  elastic  constants  can  be  predicted  from  Eqs.  (82)  through  (86)  to  an  accu¬ 
racy  of  10%. 

It  is  interesting  to  compare  Eq.  (82)  using  the  results  of  Eqs.  (89)  and  (90),  with  Cohen's 
(1985)  enqnrical  formula  for  the  bulk  noodulus 

p  =  (l971-22X)/d3-5  ,  (91) 

where  B  is  in  GPa,  d  in  A,  and  X, = 0, 1,  and  2  respectivdy  for  the  group  IV,  III-V,  and  II- VI 
semiconductors.  Both  Martin  and  Cohen's  formulas  give  B  values  to  better  than  10%  for  all 
materials  tabulated  in  Table  3.  The  B  in  Eq.  (91)  scales  as  l/d3-5,  while  in  VFF  it  scales  as  l/d^. 

10.  TIGHT  BINDING  THEORY 

In  the  semi-empirical  tight-binding  (TB)  approach,  the  total  energy  of  a  semiconductor 
crystal  is  assumed  to  be  the  sum  of  the  electrxm  energies  Gv(iO  in  the  valence  bands  plus  repulsive 
pair  energies  uij  between  the  nearest-neighbor  atoms  (Chadi,  1978): 

ET  =  Ebf +  Ur  =  XZcv(k)+  S  £uij  .  (92) 

*  k  i>  j 

Furthermore,  the  band  energies  are  interpolated  by  using  a  TB  Hamiltonian  which  con¬ 
tains  term  values  of  the  atoms,  and  a  handful  of  interaction  parameters  between  orbitals  of  the 
neighboring  atoms.  Despite  the  simplicity  of  Eq.  (92),  recent  frrst-principles  theories  have  given 
some  suppon  to  this  approximation. 
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One  viitue  of  the  TB  approach  over  valence  forcefield  models  is  that  it  is  a  quantum  the¬ 
ory  without  much  complication.  As  compared  to  first-principles  theory,  the  TB  approach  is 
easier  to  execute,  particularly  when  applied  to  complicated  systems  such  as  alloys  and  superlat¬ 
tices.  In  actual  applications,  the  TB  calculation  either  is  carried  out  using  the  full  band-structure 
calculation  (BS),  or  is  approximated  by  simpler  local  theory  such  as  Harrison's  (1980, 1983a  and 
1983b)  bond  orbital  model  (BOM). 

a.  Band  Structure  Calculations 

The  simplest  TB  Hamiltonian  contains  the  s  and  p  atomic  term  values  £$  and  Ep  for  both 
cations  and  anions,  and  the  nearest-neighbor  two-center  interactions  Vsso.  Vppg  and  Vj^. 
To  be  more  explicit,  the  8  x  8  k-dependent  Hamiltonian  contains  the  term  values  as  the  diagonal 
matrix  elements  while  the  off-diagonal  matrix  elements  between  the  cation  and  anion  orbitals  are 
given  by 

d 

where  the  sum  runs  over  the  four  first  neighbor  atoms  specified  by  the  bond  displacements  d  . 
The  y’s  are  the  orbital  indices  for  s,  px ,  Py  and  pz.  The  hy/  values  are  related  to  the  two-center 
V’s  by  the  Slater-Koster  (1954)  relations: 


hft  =  Vfjo  , 

(94) 

hgx  =  cti  ^tpa  » 

(95) 

^ppo'*'(^  ~®i)^ppx  » 

(96) 

=  ai  a2(Vpp<,-Vpp„)  , 

(97) 

where  at  =  xj/d  are  the  direction  cosines  of  d  and  the  V’s  depend  only  on  the  length  d. 
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Once  the  values  of  these  TB  parameters  and  their  dependences  on  the  bond  length  are 
known,  the  Hamiltonian  at  each  k  inside  the  Biillouin  zone  (BZ)  can  be  evaluated,  and  the  sum¬ 
mation  of  k  carried  out  to  obtain  the  band-structure  energy,  which,  when  added  to  the  repulsive 
energy,  gives  the  total  energy  of  any  specified  geometry.  All  the  elastic  constants,  associated 
internal  displacements,  and  transverse  optical  phonon  frequencies,  are  readily  calculable  follow¬ 
ing  the  procedure  of  Section  2.  The  only  point  to  note  is  the  k-sum,  which  without  strain  can  be 
calculated  accurately  by  using  the  ten  special  k  points  (Chadi  and  Cohen,  1973)  in  the  irreducible 
wedge  of  the  BZ.  Under  strain,  the  crystal  symmetry  changes;  one  needs  to  extend  these  special 
k  points  to  other  nonequivalent  wedges.  However,  since  the  sum  of  the  valence-band  energies  as 
a  function  of  k  is  a  rather  smooth  function,  a  uniform  sampling  over  the  whole  BZ  converges 
very  quickly.  A  5  x  5  x  5  grid  is  sufficiently  accurate  for  the  required  calculations.  To  avoid  the 
numerical  inaccuracy  inherent  in  direct  energy  subtractions,  one  can  also  calculate  the  second 
derivatives  directly  by  using  perturbation  theory. 

Perturbation  theory  starts  with  the  expansion  of  the  k-dependent  Hannltonian  H  in 
powers  of  the  infinitesirruil  strain  parameter  e,  keeping  terms  up  to  second  order 

H(k)  =  Ho  +  Hic  +  lH2e2  ,  (98) 

where  Hq  is  the  strain-free  Hamiltoiuan,  and  H|  and  H2  are  respectively  the  first  and  second 
derivatives  with  respect  to  e  evaluated  at  e  =  0.  The  band  energy  contribution  to  the  strain  coef¬ 
ficient  then  comes  from  the  second  derivative  of  Ebs  with  respect  to  e,  denoted  by 

^  =  .  (99) 

de2  V  k  V  c  k  evW-Eclkj 

where  Cc  (k)  and  Ick)  are  respectively  the  eigen  energies  and  eigenvectors  of  Ho  for  the  conduction 
bands,  and  vk  stands  for  the  valence  bands.  Note  that  the  inter-valence-band  contributions  in  the 
second-order  permrbation  sum  cancel  exactly  so  they  are  not  needed  in  Eq.  (99).  To  evaluate 
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these  matrix  elements  one  needs  to  have  the  first  and  the  second  strain  derivatives  of  the  two- 
center  interactions  and  the  direction  cosines  ai.  For  the  strain  parameters  e  detined  in  Section  2, 
and  for  the  two  center  interactions  V  that  scale  as  1/d”,  the  following  results  are  useful. 

(i)  For  the  bulk  modulus,  the  direction  cosines  do  not  change,  and  we  have 

av/ae  =  -nV,  and  aV/3b2  =  n(n  +  1)  V. 

(ii)  For  Cii  -  C12  with  e  specifled  in  Eq.  (9),  we  get 

aai  /ae  =  OifSii  -  8i2).  =  ai  [3(5i,  +  ^2)  -  4]  /3,  dW/dc  =  0,  and  =  4nV/3. 

(iii)  For  C44  with  the  strain  e  given  in  Eq.  (11)  and  an  internal  displacement  u,  we  find 

av/ae  =  -naia2V,  av/%i  =  naav/d,  da^  =  (5i]a2  +  6i2ai)  /  2  -aiaia2, 
aui/du  =  -8i3/d  +  Oitts/d,  a^V/ab^  =  n(n  -  l)V/9,  =  n(n  -  l)V/(3d^), 

a^ai/ae2  =  -ai(5ii  +  812)  /12,  a^  cq/au^  =  -281303/82, 
a^V/0e  au)  *  -fl(n  +  2)010203/(1,  ^nd  finally 
a^Oi/(ae  au)  =  (8110203  +  8120103 + 28130102  -  601010203)  /  (2d) 

Also  note  that  three  second  derivatives  of  the  band-structure  energy  are  needed,  namely 
a^Eb»/ae2 ,  a^b,/(ae  au),  and  a^Efcg/au^  for  the  evaluation  of  D,  and  <|>  of  Eq.  (13) 
respectively. 


b.  Bond  OrbHal  Model 

Harrison's  (1980,1983a  and  1983b)  bond  orbital  mcxiel  (BOM)  emphasizes  calculations 
of  the  TB  total  energy  in  terms  of  local  energies.  One  special  feature  of  the  BOM  is  its  univer¬ 
sality.  Another  feature  is  that  its  simple,  and  often  analytical  forms  provide  direct  insight  into 
the  essential  physics.  Although  BOM  aims  at  predicting  trends,  it  is  reasonably  accurate  in  many 
cases. 

The  band-structure  energy,  or  the  center  of  gravity  of  the  valence  band,  in  BOM  is  com¬ 
puted  in  the  following  steps.  The  terms  involved  are  indicated  in  Figure  2. 
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(1)  Construct  the  sp^  hybrid  orbitals  |  h  >  for  each  atom;  these  hybrid  orbitals  are  directed 
toward  the  neighboring  atoms.  For  example,  the  hybrid  in  the  [111]  direction  is  given  by 

lh>*(ls>  +  |px>  +  lpy>-Hpr>)/2  .  (100) 

The  hybrid  energy  is  then  given  byeh  =  <hlHlh>  =  (Es  +  3ep)  /  4.  In  the  zinc-blende 
structure,  the  cation  hybrid  energy  ^  is  in  general  different  from  the  anion  hybrid  energy  ^ . 
Note  that  the  two  hybrid  orbitals  of  the  same  atom  but  in  two  different  directions  are  now 
coupled  by  the  so-called  metallic  energy  Vis<hlHlh'>s(es-^)/4. 

(2)  Construct  the  bonding  and  antibonding  molecular  orbitals,  I  b  >  and  I  a  >,  from  the 

two  hybrid  orbitals  I  h^  >  and  I  h^  >  directed  toward  each  other  along  the  same  bond  by 
diagonalizing  a  2  x  2  matrix  withE^  and  on  the  diagonal,  and  V2  =  <  h^  I H I  h^  >  as  the  off- 

diagonal  matrix  elements.  The  resulting  energies  for  lb  >and  I  a>  are  eb  =  ^-V^T^  and 
e*  =  ih  +  Vv|  + V3  respectively,  where  ^  *  (e|  ej^)  /  2  is  the  mean  hybrid  energy,  V2  is  the 
covalent  energy,  and  V3  =  (4  -  ^)  /  2  is  the  polar  energy.  The  eigen  states  can  also  be  written 
explicitly  in  terms  of  diese  energies: 

|b>  =  y[l-»-ap)^  |hA>^.V7^^0p)y2  \\f>  (101) 

|a>  =  -Y(l-ap)y2  |hA>■^ y(r+ap)72  |hC>  (102) 

where  Op  is  called  the  polarity  and  is  defined  as 

ap  =  V3/(v|-«-v5f'^  .  (103) 

(3)  The  quantity  Eb  would  be  the  center  of  gravity  of  the  valence  bands,  if  interactions 
between  states  on  different  bonds  were  neglected.  Harrison  (1983b)  incorporated  these  interac¬ 
tions  in  a  perturbation  theory  in  which  the  change  of  the  bonding  energy  is  given  by 

Aeb.X  •  ('W) 

«' 
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where  the  sum  runs  over  the  antibonding  states  of  the  suirounding  bonds.  Note  that  the  interac¬ 
tions  among  the  bonding  states  lead  to  the  formation  of  the  valence  bands,  but  do  not  shift  their 
center  of  gravity;  therefore  these  interactions  need  not  to  be  considered  in  the  total  energy  calcu¬ 
lation.  This  simplification  transpires  the  major  physics  in  the  tetrahedral  bonding  of  semicon¬ 
ductors.  Including  the  energy  correction  AEb,  the  final  band-structure  energy  per  bond  (which 
contains  two  electrons)  is  given  by 

Eb  =  2Cb  +  2Aeb  .  (105) 

Harrison  (1980)  referred  to  the  second  term  as  the  metallization  energy.  Besides  provid¬ 
ing  a  simple  means  for  evaluating  the  center  of  gravity  of  the  valence  electron,  Harrison's  BOM 
also  provides  a  set  of  uiuversal  TB  parameters.  Based  on  comparison  with  the  free-electron  band 
width  (Froyen  and  Harrison,  1979)  and  with  oxqiirical  TB  parameters,  Harrison  (1983b)  deduced 
the  following  set  of  uiuversal  two-center  interactions: 

Vjm' =  1^051' ti/(md^)  ,  (106) 

with  tImo  =  -1.32,  Ti,po  =  1.42,  Tippo  =  2.22  and  ilpp*  =  -0.63,  where  m  is  the  free-electron 
noass,  and  d  the  bond  length.  In  units  where  d  is  in  A  and  V  in  eV,  Vga'  7.62  T|aa'/d^.  The 
pair-repulsive  energy  u  in  the  BOM  is  taken  as  resulting  from  the  overlap  of  wavefunctions  of 
the  orbitals  on  the  two  centers,  and  was  shown  to  have  die  form 

u  =  uo(do/d)*  .  (107) 

where  do  is  the  equilibrium  bond  length.  The  value  of  uq  is  determined  by  requiring  that  do  is  the 
experimental  value.  Note  that  the  d  dependences  of  both  Voa'  and  u  are  taken  to  be  the  proper 
scaling  not  only  among  different  systems  but  also  within  the  same  system,  as  the  bond  length 
varies  under  distcnrtions. 

Under  these  assumptions,  the  bond  energy  Ebond  >  which  is  defined  as  the  difference 
between  the  energy  per  bond  in  a  semiconductor  and  the  average  energy  per  two  electrons  in  the 
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fir&i  atoms,  i.e., Ebond  Eb  +  uo  ~  2i^  takes  the  following  simple  form  for  a  nonpolar  semicon¬ 
ductor  (Harrison  1983a): 

Ebond  “  ^2(1  —  Ct|n  +  9c^/ 16)  ,  (108) 

where  Om  is  called  the  metallicity  and  is  defined  as  2Vi  Ni-  Fot  a  polar  semiconductor,  Ebond 
becomes  slightly  more  complicated, 

Ebond»2^-2i-2(v|  +  vi)^'^[l-^  +  ^c4(v?c  +  v5j/(vl  +  vi)]  ,  (109) 

where  etc  »  Vl  -  is  called  the  covalency,  and  Vic  tuid  Via  are  the  metallic  energies  for  the 
cation  and  the  anion  respectively.  The  bulk  modulus  also  takes  a  very  simple  form;  for  a  group- 
rv  semiconductor,  it  reads 

B=-2V2(l-9c^l6)/(VJd3)  ,  (110) 

and  for  a  polar  semiconductor,  it  becomes 

B.-2V2[ai-|a|(5oJ-4)(vtc  +  NlA)/lV2  +  Vl)]/(V5#)  .  (Ill) 

These  expressions  show  that  the  bulk  modulus  varies  as  1/d^  in  the  pure  covalent  case,  and  as 
1/d^  in  the  extreme  ionic  limit  V3  »  V2.  Note  duu  this  result  is  different  fiom  the  l/d^-^  depen¬ 
dence  in  Cdien's  (1985)  formula  and  the  l/cH  scale  in  WF. 

Shear  strains  cause  a  semiconduemr  to  distOTt  away  from  perfect  tetrahedral  symmetry. 
To  deal  with  the  shear  elastic  coefficient,  the  BOM  has  to  be  modified.  A  simple  approximation 
is  the  rigid  hybrid  model  (Harrison,  1983b;  van  Schilfgarrde  and  Sher,  1987)  in  which  the  hybrid 
orbitals  of  each  atom  are  assumed  to  remain  in  dteir  original  tetrahedral  directions  despite  the 
lattice  disttmion.  Then  the  hybrids  of  two  nearest  neighbor  atoms  making  up  the  bonding  and 
antibonding  states  no  longer  are  directed  toward  each  other,  as  shown  in  Figure  3.  There  is  a 
misalignment  angle  6  between  each  hybrid  and  the  line  connecting  the  two  atoms,  and  the  cova¬ 
lent  energy  V2  is  given  by 

V2 (0)  =  J[v«,-2/3‘cos 0  V,po-cos2  0  Vppo  +  3  (1  -cos^  0)  Vppo]  . 


(112) 


The  lowcst-order  change  8V2,  caused  by  an  infinitesimal  angular  misalignment  50,  is  then  given 
by 

8V2  =  i(VJV,po  +  3Vppc-3V^)(5ef  .  (113) 


Under  the  strain  e  described  in  Eq,  (9)  for  Cn  -  C12,  there  is  no  bond  length  change,  and  (50)2  = 
2e2/3.  If  one  assumes  that  the  metallization  cotq>ling  is  only  through  the  metallic  energies 
wd  Via.  as  has  been  assumed  so  far,  then  the  change  of  die  crystal  energy  is  the  change  of  the 
band-structure  energy  due  to  5V2.  Then  according  to  Eqs.  (10)  and  (105), 

Cii-Ci2  =  JCL^8V2/c2 
2d3aV2 


(114) 

=  ;^ac(V3‘V^  +  3Vppc-3V^)[l+(i-|ai)(v?c  +  V?J/(v|  +  v8] 


Harrison  (1983b)  has  pointed  out,  however,  that  in  addition  to  Vj,  other  interactions  such  as 
shown  in  Figure  2  produce  important  contributions  to  the  shear  elastic  constant.  By  arguing  that 
these  other  contributions  must  cancel  those  associated  with  the  change  5V2  arising  from  the  met¬ 
allization  energy  in  a  rigid  rotation,  Harrison  deduced  the  following  expression: 

Cn-Ci2=^oa(V5V^  +  3Vppo-3V^)  .  (115) 

4d'* 


Under  the  strain  e  for  the  C44  given  in  Eq.  (1 1)  and  with  an  internal  displacement  given 
by  u  =  Tid  //7as  described  in  Section  2,  the  bond  misalignment  angles  for  the  four  bonds  have  the 
same  magnitude  with  (Sof  =  2(ti  +  c/2y^l9.  The  four  bond  lengths  also  change,  with  the  change 
for  one  pair  given  by  5ri  =  5r2  =  5  +  e  and  by  5r3  =  5r4  =  -  5  +  e  for  the  other  pair,  where 
5  =  (c  -  Ti)  d/3  and  £  =  (11  +  c  /2)?  d/9.  If  again  one  assumes  that  the  metallization  is  only 
through  Vi,  then  the  strain  energy  density  can  be  shown  to  be  given  by 

U  .  9BSV(2d2)  +  3(C„  -  c,2)  (sef 

..  •  (110) 

=  B(e-Tl]?/2  +  (C,i-C,2)(Tl  +  c/2)2/3  . 
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For  a  given  strain  e,  U  can  be  minimized  with  respect  to  t),  which  yields  the  Kleimann  displace¬ 
ment  parameter  ^  =  ti/e  with  ^  given  by 

C  =  (B-C/3)/(B  +  2C/3)  ,  (117) 

where  C  =  Cn  -  Ci2-  Finally  from  U  =  €440^  H  the  following  relationship  is  established: 

9/C44  =  6/C  +  4/B  .  (118) 

or 

Ibom=9BC/[C44(6B  +  4C)]=1  .  (119) 

If  one  includes  the  effect  of  Vj,  the  energy  density  will  involve  an  additional  term  which  couples 
Sr  and  56.  Then  the  analysis  is  no  longer  simple. 

The  above  explicit  formulas  for  the  elastic  constants  in  BOM  are  not  much  more  compli¬ 
cated  than  the  valence-force-field  model.  However,  they  relate  macroscopic  forces  to  intrinsic 
atomic  interactions.  It  is  interesting  to  note  that  the  simple  identity  relation  of  Eq.  (1 19)  holds 
very  well  As  can  be  seen  in  Table  3,  this  result  is  certainly  better  than  the  Ik  of  Eq.  (78),  and  is 
very  competitive  to  Martin's  idoitity  Im  which  requires  the  inclusion  of  his  particular  treatment 
of  Coulomb  forces.  Because  die  Coulomb  energy  is  not  included  explicitly  in  Eq.  (118),  its  con¬ 
tribution  to  the  elastic  constants  is  probably  small. 

c.  Numerical  Results  and  Quamitative  Applications 

To  study  the  quantitative  aspect  of  the  theory,  one  first  needs  to  establish  the  TB  parame¬ 
ters.  Table  4  lists  the  term  values  we  will  use.  The  values  of  the  outermost  valence  levels  are 
taken  to  be  minus  the  experimental  first  ionization  energies  listed  in  Kittel's  (1986)  book,  and  the 
other  term  values  are  deduced  from  calculated  extraction  or  promotion  energies  using  norm-con¬ 
served  atomic  pseudopotentials  (Bachelet  et  al.,  1982).  These  term  values  are  very  similar  to 
Mann's  (1967)  Hatree-Fock  calculations  used  by  Harrison  (1980),  which  are  also  given  in 
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Table  4.  The  major  difference  in  the  two  sets  occurs  in  heavy  elements,  where  relativistic  s- 
shifts  are  important,  but  were  not  included  in  Mann's  results. 

Table  5  lists  the  values  of  bond  lengths,  bond  energies  Ebond>  the  elastic  coefficients  B, 

C  s  Cii  -  Ci2,  and  C44,  and  the  zone-center  TO  phonon  frequencies  ©  for  a  selected  group  of 
systems  to  be  examined  in  the  remainder  of  this  section.  Experimental  values  are  also  presented, 
with  the  exception  of  extrapolations  for  the  elastic  constants  of  AlP  and  AlAs.  Table  6  compares 
results  between  the  BOM  and  the  full  band-structure  (BS)  calculation  using  Harrison's  universal 
TB  parameters.  Except  for  the  Ge  and  HgTe  values,  the  agreement  between  the  two  calculations 
for  Ebond  is  within  10%  or  better.  The  calculated  Ebond  values  are  also  in  fair  agreement  with  the 
experimental  values  except  for  diamond.  Since  diamond  has  a  much  smaller  bond  length  than 
the  rest  of  the  systems,  this  discrepancy  is  an  indication  of  a  limit  to  the  scaling  rules  for  both 
Voa'  and  u.  Although  the  trends  for  the  bulk  moduli  from  both  calculations  are  similar,  the  cal- 
culational  errors  in  the  BOM  can  be  as  large  as  50%.  Also  note  that  the  calculated  values  of  B 
for  most  systems  are  only  about  one  half  of  the  experimental  values.  The  tabulated  values  of  C 
and  C44  for  BOM  are  based  on  Eqs.  (1 15)  and  (118)  respectively.  Considering  the  simplicity  of 
these  formulas,  the  agreement  with  the  band  structure  calculations  is  remarkable.  However,  the 
overall  calculated  values  for  these  two  shear  coefficients  are  also  consistently  smaller  than  the 
experimental  values. 

The  above  comparisons  show  that  the  BOM  and  the  BS  calculations  predict  similar  quali¬ 
tative  trends  for  the  binding  energies  and  the  elastic  constants.  In  this  regard,  the  BOM  has  the 
advantage  of  providing  explicit  forms  to  show  the  dependences  on  bond  lengths  and  polarities. 
However,  the  merit  of  the  TB  theory  over  the  valence-force  model  is  in  its  ability  to  incorporate 
atomic  quantities  to  miiiuc  quantum  mechanical  effects.  To  be  useful  for  specific  material 
science  applications,  the  theory  has  to  be  more  quantitative.  Successful  quantitative  application 
of  the  TB  theory  has  been  made  by  Chadi  (1978, 1979  and  1984)  in  his  study  of  semiconductor 
surfaces.  The  comparisons  in  Table  6  indicate  that  the  BOM  should  be  treated  differently  than 
the  BS  calculation  when  considered  for  quantitative  applications.  If  one  wishes  to  calculate  the 
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propeities  using  the  local  picture,  one  should  use  the  BOM.  If  one  wants  to  carry  out  the  TB 
Hamiltonian  precisely,  one  needs  to  adopt  a  different  set  of  parameters  based  on  the  BS  calcula¬ 
tion.  For  this  reason,  we  shall  next  consider  quantitative  applications  of  BOM  and  BS  calcula¬ 
tions  separately. 

There  are  many  ways  to  parametrize  the  TB  theory.  Chadi  (1979,1984)  used  a  simple 
form  for  the  repulsive  energy  u  =  a  +  b(d-do)  +  c(d-  do)^  and  the  same  l/d^  scaling  for  the  TB 
parameters  Voq'.  To  keep  the  theory  as  close  to  Harrison's  (1983a  and  1983b)  form  as  possible 
but  free  it  from  the  l/d^  and  l/d^  scaling  rules  for  Voq*  and  u  respectively,  we  assume  the 
following  forms, 

Vaa'  =  ’C'(do/dr  (120) 

and 

u  =  Uo(do/d)F»  (121) 

where  the  superscript  and  subscript  o  indicate  the  values  evaluated  at  the  equilibrium  bond  length 
do.  For  simplicity,  the  values  of  are  taken  to  be  Harrison's  values  given  in  Eq.  (106)  scaled 
by  a  factor  f: 


Thus  there  are  four  parameters  for  each  system:  the  scaling  parameter  f,  the  powers  n  and 
m,  and  the  value  Uq.  These  parameters  can  be  determined  by  requiring  that  the  nKxlel  produce 
the  correct  experimental  values  for  Ebond>  do,  Cn  -  C12,  and  B.  Since  Cn  -  C12  is  only  gov¬ 
erned  by  in  both  BOM  and  band  calculation,  it  alone  determines  the  scaling  factor  f.  Then 
the  bond  energy  Ebond  can  be  used  to  determine  Uo-  The  requirement  that  the  rirst  derivative  of 
Et  is  zero  at  do  then  deteimines  the  ratio  of  the  powers  n/m,  which  couples  with  the  equation  for 
the  bulk  modulus  to  yield  the  values  for  n  and  m.  One  can  then  use  these  sets  of  parameters  to 
check  the  validity  of  the  model  by  calculating  other  quantities  not  employed  in  the  fitting,  e.g., 
C44,  the  internal  displacement  parameter  and  the  optical  phonon  frequencies  o)  at  the  zone 
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center.  If  the  results  are  acceptable,  the  model  can  be  extended  to  more  complicated  systems 
such  as  alloys  and  superlattices  with  local  environments  similar  to  the  bulk  crystals. 

Table  7  shows  the  results  for  f,  n,  m,  and  Uq  obtained  from  the  above  fitting  procedure  by 
using  the  full  band-structure  calculations,  and  the  corresponding  values  of  C44,  and  co  calcu¬ 
lated  for  consistency  checks.  The  scaling  factor  f  ranges  from  1  to  1.4  and  tends  to  decrease  with 
an  increase  in  polarity.  In  the  power  dependence  of  (do/d)",  n  ranges  from  2.8  to  4.3, 
which  is  larger  than  the  n  =  2  used  in  Harrison's  universal  TB  parameters.  For  the  repulsive  pair 
energy  u  =  UoCdo/d)*",  the  power  m  ranges  from  3.8  to  6.8.  The  ratio  m/n  falls  in  the  range  from 
1.3  to  1.9,  which  is  smaller  than  the  m/n  =  2  used  by  Harrison.  The  calculated  values  of  C44  for 
most  systems  agree  with  the  experimental  data  to  10%  or  better,  except  for  diamond  and  ZnS. 
Note  that  the  experimental  data  for  ZnS  are  rather  dispersed.  The  calculated  TO  optical  phonon 
modes  at  F  in  1/cm  for  most  group  IV  and  m-V  systems  also  agree  with  experiments  to  10%  or 
bener.  The  discrepancies  for  the  n-VI  systems  are  larger,  about  15%.  Reliable  results  for  C  from 
both  experiments  and  first-principles  calculations  are  available  only  for  a  limited  number  of  sys¬ 
tems.  The  calculated  C  in  the  TB  model  agrees  very  well  with  those  results,  as  shown  in  Table  1. 
The  overall  results  for  C44,  ^  and  ca  in  the  TB  calculations  are  equivalent  to  those  based  on  the 
valence  force  model  including  Martin's  Coulomb  force  corrections.  By  construction  the  TB 
model  also  produces  the  correct  cohesive  energies,  bond  lengths,  bulk  moduli  and  shear  coeffr- 
cients  Cn  -  C12,  because  these  quantities  are  used  to  frt  the  parameters. 

The  results  in  Table  7  are  based  on  the  term  values  given  in  Table  4  and  the  TB  parame¬ 
ters  scaled  from  Harrison's  universal  parameters.  It  is  useful  to  know  how  the  predictions  are 
influenced  by  these  parameters  and  the  fitting  procedure.  Table  8  shows  the  results  based  on 
Chadi  procedure  in  which  the  TB  matrix  elements  Vaa'  are  scaled  as  1/d^,  and  the  repulsive  pair 
energy  is  taken  to  be  u  =  Uq  +  ui  (d  -  do)  +  U2  (d  -  do)^.  The  parameter  Uo  is  set  to  produce  the 
correct  bond  energy,  ui  is  determined  by  requiring  the  correct  equilibrium  bond  length,  and  U2  is 
fixed  by  the  bulk  modulus.  Two  sets  of  TB  parameters  are  tabulated  for  each  system:  one  is  the 
set  used  by  (^adi  (1978, 1979  and  1984),  the  other  is  the  set  obtained  by  multiplying  Harrison's 
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Vaa’  by  the  scaling  factor  f  listed  in  Table  7.  For  convenient  comparison,  the  zero  of  the  term 
values  is  set  equal  to  the  anion  s  energy.  Despite  considerable  differences  in  these  two  sets  of 
TB  parameters,  the  results  of  the  predictions  firom  both  sets  are  very  similar  and  also  very  similar 
to  those  predicted  from  the  other  procedure  used  for  the  results  in  Table  7.  The  only  noticeable 
difference  between  the  predictions  in  Table  8  and  Table  7  is  that  the  present  procedure  produces 
larger  phonon  frequencies  and  slightly  smaller  C44  values.  We  also  note  that  the  fitted  parame¬ 
ters  Uo.  ui,  and  U2  and  the  predicted  values  for  Chadi's  elastic  constant  set  in  Table  8,  are  not  the 
same  as  Chadi's  published  (1979)  values;  these  give  bulk  moduli  about  20%  snudler  than  the 
experimental  values,  but  give  considerably  better  phonon  frequencies. 

To  parameterize  the  BOM,  several  dififoent  stages  of  approximations  can  be  made. 
However,  for  a  general  application,  the  full  BOM  steps  presented  in  Eqs.  (1(X))  to  (104)  for  cal¬ 
culating  Et)  should  be  followed,  regardless  of  approximations.  The  simplest  model,  referred  to  as 
BOM(l),  is  to  include  only  Vja  uid  Vjc  in  the  the  matrix  element  <b  |  H  |  a'>  for  the  calculation 
of  the  metallization  energy  in  Eq.  (104).  The  next  approximation,  BOM(2),  is  to  include^  as 
well.  Finally  one  can  include  all  the  first-neighbor  interatomic  TB  parameters  in  <b  |  H  |  a'>;  this 
will  extend  the  |  a'>  to  those  belcmging  to  the  secemd-neighbor  bonds.  This  last  approximation 
will  be  refened  to  as  BOM(3). 

Table  9  shows  the  results  for  BOM(l)  following  the  parameterization  procedure 
described  in  Eqs.  (120)  to  (122).  The  fitted  parameters  f,  n,  m,  and  Uo  are  substantially  different 
from  those  based  on  the  BS  calculations.  The  predicted  C44  for  the  group  FV  and  HI-V  systems 
are  slightly  larger  than  the  experinwntal  values,  but  good  for  the  II- VI  systems.  The  calculated  ^ 
values,  although  not  all  smaller  than  those  in  Table  7,  are  smaller  on  the  average.  The  predicted 
phonon  frequencies  are  too  high. 

The  parameters  and  the  predicted  results  from  BOM(2)  are  listed  in  Table  10.  These 
parameters  more  closely  resemble  those  in  Table  8  tiian  do  the  values  from  BOM(l).  However, 
the  predicted  C44  values  are  still  too  small,  and  the  C  values  ate  too  large,  but  the  to  values  are 
better  than  those  from  BOM(l). 
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Table  1 1  contains  the  results  firotn  BOM(3);  these  approach  those  of  the  BS  calculations. 
In  comparison  with  Table  7  and  the  experimental  values  in  Table  5.  the  BOM(3)  does  well  for  co, 
produces  slightly  smaller  C44,  and  probably  slightly  larger  C  values. 

In  conclusion,  the  TB  method  is  a  reasonable  approach  to  the  static  elastic  properties  of 
semiconductors.  If  carried  out  rigorously,  the  TB  parameters  in  Table  7  will  provide  quantitative 
results  for  supo-lattices,  alloys,  and  possibly  surfaces  in  which  the  local  environments  are  similar 
to  those  in  the  bulk.  The  quantitative  predications  of  BOM  are  not  as  good  as  the  BS  calcula¬ 
tions,  but  are  still  reasonable.  The  fitted  parameters  given  in  Tables  9  to  1 1  aUow  different 
stages  of  approximations  to  be  made  using  the  BOM.  This  is  especially  useful  for  more  compli¬ 
cated  systems,  because  computationally  the  BOM  is  about  two  orders  faster  than  the  band  struc¬ 
ture  calculations. 

11.  SEMICONDUCTOR  ALLOYS 

The  systems  to  be  considered  in  this  section  are  alloys  of  the  diamond  and  zincblende 
semiconductors,  both  the  ordered  and  disordered  alloys.  The  ordered  alloys  include  binary  com¬ 
pounds  such  as  SiC,  and  ternary  compounds,  such  as  GaInAs2,  in  three  crystal  structures  of  the 
types  CuAuI,  chalcopyrite,  and  CuPt  as  shown  in  Figure  4  (Bernard  et  al.,  1988).  The  disordered 
alloys  include  binary  solutions  such  as  Sii.xGex  and  pseudolnnaiies  such  as  Hgi.xCdxTe  and 
GaAsi.xSbx,  where  x  is  the  fractional  concentration.  These  alloys  have  been  widely  used  and 
studied;  however,  detailed  information  about  their  elastic  constants  is  scarce  both  experimentally 
and  theoretically.  One  reason  for  the  lack  of  rigorous  calculation  is  that  the  elastic  constants  of 
these  systems  are  more  complex;  existing  theories  are  not  as  accurate,  particularly  for  disordered 
alloys.  Another  reason  nuiy  be  attributed  to  the  fact  that  most  properties  of  these  alloys,  includ¬ 
ing  their  elasticity,  were  thought  to  be  reasonably  well  q)proximated  by  the  concentration 
weighted  averages  of  their  constituents.  Because  of  the  rudimentary  state  of  the  theory,  we  shall 
deal  only  with  die  sin^lest  elastic  constanL  the  bulk  modulus.  Our  focus  is  on  the  difference 
between  the  alloy  bulk  modulus  B  and  the  concentration  weighted  averaged  value  B,  i.e., 
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AB  =  B  -  B.  There  are  several  fundamental  questions  that  can  be  addressed.  Is  AB  positive  or 
negative?  Does  the  sign  and  the  magnitude  of  AB  depend  on  the  state  of  order?  How  can  we 
calculate  the  bulk  modulus  in  a  disordered  system?  Analysis  of  these  questions  constitutes  the 
content  of  this  section. 

a.  Ordered  Alloys 

Long-range  ordering  has  been  found  to  exist  in  many  epitaxially  grown  m- V  semicon¬ 
ductor  alloys  (Kuan  et  al.,  1985;  Jen  et  al.,  1986;  Ihm  et  al.,  1987;  Klem  et  al..  1987;  Huang  et 
al.,  1987;  Gomyo  et  al.,  1986, 1987,  and  1988;  Nmman  et  al.,  1987;  Shahid  et  al.,  1987).  Wei 
and  Zunger  (1989)  have  given  a  rather  complete  list  of  these  ordered  alloys,  most  of  which  are  of 
the  form  ABC2  existing  in  three  different  types  of  structures,  CuAuI,  chalcopyrite,  and  CuPt;  all 
have  their  atomic  planes  stacked  as  ACBCACBC,  but  along  three  different  directions,  (100), 
(210),  and  (111)  respectively,  as  shown  in  Hgure  4.  There  have  been  several  first-principles  cal¬ 
culations  made  to  study  the  structural  properties  of  these  conqmunds;  these  focused  mainly  on 
the  cohesive  energies  and  bond  lengths.  As  indicated  in  the  preceding  section,  one  virtue  to  fit¬ 
ting  the  TB  model  for  die  bulk  semiconductors  is  to  use  it  for  interpolating  alloy  properties. 

Table  12  lists  the  results  for  the  bulk  moduli  of  a  number  of  m-V  and  n-VI  alloys  derived  from 
full  TB  band  structure  calculations  using  the  parameters  given  in  Table  7.  Also  listed  are  the 
average  values  and  percentage  deviations  from  the  mean  AB  /  B.  Note  that  all  AB  values  are 
negative  and  that  most  of  the  magnitudes  are  small,  except  for  Ga2AsSb  and  Ga2PSb;  the  latter 
has  the  largest  difference  in  the  constituent  compounds  aiix>ng  the  alloys  listed.  Although  the 
magnitudes  of  AB  get  larger  for  systems  with  larger  differences  in  the  bond  lengths,  the  depen¬ 
dence  does  not  seem  to  be  a  simple  function  of  the  bond  length  difference.  The  uniformly  nega¬ 
tive  AB  values  also  appeared  in  the  first-priciples  local  density  functional  calculations  for 
ordered  GaAsSb  alloys  by  Ferreira  et  al.  (1989). 

The  reason  for  the  negative  values  of  AB,  in  a  very  qualitative  argument,  is  that  the  bulk 
nooduli  of  semiconductors  scale  inversely  as  high  powers  of  the  lattice  constant,  and,  at  the  same 
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time,  the  alloy  lattice  constant  is  approximated  well  by  the  mean  value,  by  Vegard's  law  (1921). 
This  implies  that  the  value  of  B  at  the  mean  lattice  constant  should  lie  below  the  straight-line 
average.  Since  the  TB  results  for  the  bulk  moduli  constants  should  not  be  qualitatively  different 
from  the  valence  force  field  model  (WF)  predictions,  most  of  the  key  physics  for  the  bowing  of 
B  should  be  contained  in  a  VFF  analysis.  The  majm-  effects  in  the  WF  can  in  turn  be  realized 
fnnn  the  following  simple  analysis. 

Consider  the  local  structure  of  a  CuAuI  or  chalctqryrite  crystal  ABC2.  Focus  on  a  local 
tetrahedral  cluster  A2B2C  with  the  two  A  atoms  and  two  B  atoms  on  the  vertices  of  the  tetrahe¬ 
dron  and  the  C  atom  near  the  center.  Let  the  coordinates  of  the  two  A  atoms  be  (-1,  1, 
and  (-1. 1.  -l)d^  and  the  two  B  atoms  be  at  ( 1, 1,  1)  d/fJ  and  ( 1,  -1,  -l)d//J.  Let  the  force 
constants  be  kA  and  kg,  and  equilibrium  bond  lengdrs  be  dA  and  da  for  the  AC  and  BC  bonds 
respectively.  To  attain  equilibrium,  the  central  C  atom  is  displaced  by  (e,  0, 0)  d/V3.  We  further 
define  mean  values  9  =  (dA  +  dB)/2  and  E  =  (kA  +  kB)/2,  the  relative  differences 
5o  =(dA-dB)/d  andAo  =  (kA-kB)/E  8ndd=d(l -f  5).  Then  the  AC  bond  is  stretched  by 
an  amount  d  (S  +  e/3  -  from  its  equilibrium  value  and  similarly  the  BC  bond  is  compressed 

by  d(5-c/3-(-5o/2.  The  strain  energy  for  any  arintrary  5  and  e  is  given  by 
AE  =  d^[kA(5-»-e/3-S^]F  +  kB(5-e/3'»-So/2f].  When  AE  is  minimized  with  regard  to  5 
and  E,  (Mie  finds  5^0  and  e  =  3fio/2,  and  the  minimum  AE  is  zero.  If  the  crystal  expands  uni- 
formly  with  5  having  a  fixed  small  value,  then  e  becomes  E  =:  35^  /  2  -  3Ao5  and  AE  =  2E(1  -  A^  / 
4)3  2  52.  Thus  the  effective  spring  constant  is 

keff  =  k(l-Aj/4)  ,  (123) 

which  is  smaller  than  the  average  value  k .  This  weakening  of  the  restoring  force  constant  in  the 
alloy  is  due  to  the  internal  displacement,  represented  by  E  in  the  above  model,  which  provides  an 
extra  degree  of  freedom  for  relaxation  in  response  to  the  external  stress.  The  bulk  rtKxiulus  B  is 
prc^xrrtional  to  kefi/d,  so  the  alloy  bulk  ttKxiulus  minus  the  mean  B  is  then  given  by 

AB  *  B  (SoAo  -  5J  -  aJ  74)  ,  (124) 
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where  wc  recall  the  definitions  So  =  (di  -  d2)  /  d  and  A©  =  (ki  -  k2)  /  £.  Since  and  So  and  Aq  tend 
to  have  different  signs,  the  bond  length  difference  gives  an  extra  negative  contribution  to  AB  (the 
first  two  terms).  If  both  the  bond-stretching  force  constant  a  and  the  bond-angle  restoring  force 
P  in  the  WF  are  included,  the  equilibrium  value  of  AE  is  no  longer  zero;  but  the  deviation  AB 


can  also  be  shown  to  be  similar  to  Eq.  (123)  and  is  given  by 


AB  =  B 


So 


/3Aa-t-Ap\  ^  ^  (Aa)^ 

\  3a  +  p  /  (a  +  2p)  (3a  +  p ) 


/4  , 


(125) 


where  Aa  =  (ai  -  a2)  and  a  » (ai  -t-  a2)  H  and  similariy  for  AP  andp.  Eq.  (125)  reduces  to 
(124),  if  P  is  set  equal  to  zero. 

The  above  descriptions  illustrate  two  mechanisms  for  the  negative  AB  values,  the  l/a<l 
scaling  of  B.  with  q  ranging  from  3.5  to  9,  and  more  degrees  of  freedom  for  internal  relaxation. 
Quantitative  results  should  be  described  by  TB,  because  in  addition  to  the  strain  energy,  there  is 
also  some  chemical  effect  built  into  the  TB  theory.  Although  these  ordered  compounds  have 
been  found  from  epitaxial  growth,  the  bulk  moduli  are  probably  difficult  to  measure,  because 
these  alloys  are  not  single  bulk  crystals  and  because  the  ordning  is  only  partial.  It  is  interesting 
to  note  that  the  B  values  for  a  SiC/AlN  alternating  layer  superlattioe  along  (100)  and  for  the  con¬ 
stituent  compounds  have  been  calculated  by  Lamlvecht  and  Segall  (1990)  using  the  LMTO,  their 
percentage  deviation  from  the  mean  AB/B  was  found  to  be  about  -2%,  which  falls  in  the  range  of 
the  ternary  alloys  in  Tables  12  and  13. 

Not  all  the  mechaiusms  considered  above  apply  to  the  ordeted  compounds  of  the  elemen¬ 
tal  semicomiuctors,  because  intenud  relaxation  under  pressure  may  not  be  allowed,  e.g.,  if  the 
structure  is  assumed  to  be  zincblende.  Unfortunately,  a  simple  analysis  of  the  elastic  constants  of 
the  4-4  compounds  cannot  yet  be  made,  because  the  tight-binding  and  WF  parameters  have  not 
yet  been  extended  to  deal  with  the  atcnnic  pairs  not  existing  in  the  constituent  crystals.  However, 
several  first-principles  calculations  have  been  made  cm  the  ordered  SiC  and  SiCe  (Martins  and 
Zunger,  1986;  Qteish  and  Resta,  1988;  van  Schilfgaarde,  19{K)).  The  main  results  are  listed  in 


D-55 


Table  13;  the  theoretical  results  were  calculated  for  the  zincblende  structure.  The  plane- wave 
pseudopotential  (PP-PW)  calculation  of  Martins  and  Zunger  (1986)  for  SiC  gave  a  -21%  value 
for  AB/B,  which  is  in  reasonable  agreement  with  the  experimental  value  of  -17%.  This  is  con¬ 
sistent  with  the  qualitative  argument  based  on  the  1/d^  scaling  of  B.  The  theory  also  yielded  a 
negative  formation  energy,  which  also  agrees  widi  experiment  For  SiGe,  the  theoretical  calcu¬ 
lations  cited  in  Table  14  gave  positive  formation  energies,  which  are  consistent  with  the  fact  that 
no  ordered  bulk  compounds  of  SiGe  have  been  grown.  However,  some  weak  ordering  has  been 
found  in  die  epitaxial  SiGe  films  (Ourmazd  and  Bean,  1985).  The  calculated  values  of  AB/B  for 
the  zincblende  SiGe  are  either  slightly  above  or  just  below  zero.  These  differences,  howevor,  fall 
within  the  uncertainties  of  the  present  first-principles  theory.  The  best  conclusion  which  can  be 
drawn  firom  these  results  is  that  the  B  value  for  SiGe  should  be  very  close  to  the  mean  value. 

b.  Diaofdarad  Alloys 

Disordered  binary  alloys  Ai.xBx  of  diamond  semicooductors  and  pseudobinaxy  alloys 
Ai.xBxC  of  zincblende  semiconductors  AC  and  BC  are  ctmsidered  in  this  section;  they  are  not 
amorphous  materials  as  they  still  possess  their  constituent  diamcmd  and  zincblende  lattices 
respectively,  as  characterized  by  their  crystal  diffraction  patterns.  We  shall  consider  the  pseudo¬ 
binaries  first  The  alloying  atoms  A  and  B  in  these  alloys  beltmg  to  a  fee  sublattice,  and  the  C 
atoms  to  the  other  sublattice.  However,  die  posititms  of  the  A  and  B  atoms  are  not  necessarily 
locked  precisely  on  the  lattice  sites.  The  extended  x-ray  absorption  fine-structure  spectroscopy 
(EXAFS)  data  (Mikkelsen  and  Boyce,  1982  and  1983;  Boyce  and  Mikkelsen,  1985;  Balzarotti  et 
al.,  1984  and  1985)  have  consistently  shown  a  binaodal  distribution  of  the  bond  lengths  in  these 
alloys,  although  the  average  lattice  constant  follows  the  Vegard  (1921)  law  a  =  (1  -  x)aAC  + 
xaBC’  Figure  5  shows  an  example  of  the  results  for  the  bond  lengths  in  Gai.xhixAs  deduced  by 
Mikklesen  and  Boyce  (1982)  from  their  EXAFS  data.  In  a  first  approximation,  the  crystal 
structure  of  an  alloy  can  be  viewed  as  having  the  A  and  B  atoms  on  their  fee  sublattice  with  an 
average  lattice  constant,  while  the  C  atoms  axe  distorted  away  from  their  lattice  sites,  in  a  way 
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similar  to  the  local  structures  of  the  three  ordered  superlatdces  considered  earlier.  The  difference 
is  that  there  is  no  long-range  superlattice  ordering  of  the  A  and  B  atoms  in  the  disordered  state. 
This  simple  crystal  picture  is  only  a  first  approxinoation;  the  above  EXAFS  experiments  also 
indicate  that  the  sublattice  of  the  A  and  B  atoms  is  less  than  a  perfect  fee.  There  are  also  theoret¬ 
ical  calculations  (e.g.,  Sher  et  al.,  1987;  Ferreira  et  al.,  1989)  which  suggest  a  certain  degree  of 
short-range  ordering  in  these  alloys,  namely  that  the  arrangement  of  the  A  and  B  atoms  is  not 
random.  Figure  6  shows  an  example  of  the  calculated  deviations  (Sher  et  al.,  1987)  of  the  proba¬ 
bilities  frcmi  random  distribution,  Apn  =  Pn  -  Pq  Gai.xInxAs  as  a  function  of  alloy  concen- 
tratiem  x,  where  pn  is  the  probability  of  having  n  Ga  atoms  and  4-n  In  atoms  on  the  vertices  of  a 
local  tetrahedral  cluster  in  the  alloy,  and  p°  =  4Cn  (1  -  x)”  x^",  '^'heie  is  a  binomial 
coefficient,  is  the  corresponding  value  for  the  raiulom  distribution. 

The  structural  energy  needed  for  calculating  the  elastic  constants  of  a  disordered  alloy  is 
an  ensemble  average  of  the  total  energy  over  the  distributitm  of  the  alloying  atoms  under  strains. 
It  has  been  demonstrated  (Ferreira  et  al..  1989)  tiiat  the  toud  energy  of  a  soniconductor  can  be 
decomposed  into  the  sum  of  multisite  cmrelatkm  energies,  frexn  the  single-site,  the  pair,  and  up 
to  a  cluster  contairung  a  handful  of  sites.  In  exher  words,  the  multisite  correlation  energies  con¬ 
verge  to  zero  quickly  at  a  manageable  number  of  sites.  This  implies  tiiat  the  structural  energy  of 
an  alloy  is  an  average  of  these  multisite  cotrelaticHi  energies.  Connolly  and  Williams  (1983), 
wmking  on  metal  alloys,  proposed  that  these  multisite  conelation  energies  be  deduced  from  the 
ordered  systems  that  are  composed  of  the  same  atoms.  This  scheme  allows  a  direct  application 
of  the  first-principles  theory  in  the  calculation  of  the  energy  parameters.  These  energetics  can 
then  be  used  in  the  alloy  statistics  such  as  in  the  cluster  variational  method  of  Kikuchi  (1951)  or 
in  the  Monte  Cario  calculations  to  deduce  the  distribution  functions  or  the  average  properties. 
This  theory  has  been  carried  out  extensively  for  semiconductOT  alloys  by  Ferreira  et  al.  (1989), 
and  respectable  results  have  been  obtained  for  the  phase  diagrams  and  alloy  equilibrium  proper¬ 
ties.  Fot  this  theory  to  fit  the  elastic  constants  requires  detailed  dependences  of  the  multisite 
energies  uixler  different  strains  that  have  yet  to  be  worked  out  Also,  the  validity  of  using  the 
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energy  parameters  deduced  &om  ordered  alloys  in  the  disordered  systems  needs  to  be  examined 
further. 

There  is  a  different  cluster  approach,  which  directly  relates  the  alloy  Hamiltonian  to  the 
distribution  function  (Gautier  et  al..  1975;  Qien  et  al.,  1987;  Berra  et  al.,  1988;  Dreysse  et  al., 
1989).  In  this  q)proach  (me  focuses  on  a  particular  cluster  in  an  alloy  ensemble.  The  average 
energy  per  cluster  can  be  writtot  (Chen  et  al.,  1988)  as 

5)  ^Cb '*'^hnin|pnPiim  ,  (126) 

n  n  “ 

where  en  is  the  energy  of  a  cluster  detached  from  a  given  alloy  configuration,  and  hnm  is  the 
interaction  energy,  which  is  the  change  of  energy  of  the  combined  system  when  the  cluster  is  put 
back  into  the  alloy.  In  Eq.  (126),  pn  is  the  prob^nlity  that  a  cluster  is  of  the  type  n,  specified  by 
the  number  of  A  and  B  atoms  and  their  arrangements,  and  Pmn  is  a  conditional  probability  that 
the  surrounding  environriKnt  is  in  state  m  when  die  cluster  is  in  state  n.  The  factor  1/2  in  Eq. 
(126)  is  to  eliminate  the  double  counting  in  the  total  average  alloy  energy  <E>sM<e>, 
where  M  is  the  ratio  between  the  the  size  of  the  alloy  and  the  cluster.  If  one  writes  (e) »  £ 
PnE(n),  then  an  effective  cluster  energy  can  defined  as 

This  procedure  is  particularly  useful  when  the  intenumcm  energy  is  short  ranged.  One 
can  start  with  a  small  cluster  and  a  given  probalnlity  disiributitm,  then  calculate  the  effective 
cluster  energies  fnnn  Eq.  (126).  These  energi^  are  then  used  in  a  statistical  theory  to  deduce  a 
cluster  distribution,  which  in  turn  is  used  to  calculate  a  new  set  of  E(n)  and  distribution  functions, 
and  the  process  is  iterated  until  it  converges.  It  ^ould  be  pointed  out  that  to  compute  the  total 
energy  of  an  alloy  quantum  mechanically,  one  needs  to  solve  the  Schroedinger  equation  for  a 
Hamiltonian  which  does  not  have  the  lattice  translational  synunetry  so  indispensable  in  tradi¬ 
tional  band-structure  theory.  If  the  fluctuaticm  of  the  alloy  potential  from  the  virtual  crystal 
approximatitm  (VGA),  where  the  alloy  potential  is  approximated  as  the  concentration  weighed 
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average,  is  small,  then  the  next  leading  correction  to  VGA  can  be  obtained  from  penurbadon 
theory.  This  should  work  for  most  semiconductor  alloys  except  for  systems  with  large  potential 
fluctuations  such  as  Hgi.xCdxTe  (Chen  and  Sher,  1982;  Spicer  et  al.,  1982;  Hass  et  al.,  1983).  A 
more  general  but  more  difficult  i^proach  is  to  extend  the  present  molecular  coherent  potential 
approximation  (MCPA)  (Hass  et  al.,  1984)  to  clusters  and  to  achieve  a  triple  self-consistency 
(Chen  et  al.,  1987):  consistency  between  clusto*  distribution  and  Hamiltonian,  between  the 
Hamilttmian  and  electron  density,  and  between  the  self-energy  operator  £  in  the  cluster  CPA 
theory  and  the  potential  fluctuatitMts.  To  date  this  theory  has  been  carried  out  only  for  metal 
alloys,  and  then  only  within  the  single-site  KKR-CPA  with  a  random  distribution  (Schwartz  and 
Bansil,  197S;  Gyorffy  and  Stocks,  1978).  MajOT  woik  is  needed  to  determine  if  this  approach 
can  achieve  the  same  degree  of  rigor  for  disoideied  alloys  as  self-consistent  density  functional 
theory,  which  has  been  successfully  used  in  dealing  with  crystalline  semiconductors. 

The  above  idea  has  been  applied  to  an  elastic  medium  model  to  deduce  a  mean  field 
theory  for  the  internal  strain  and  bulk  modulus  in  semiconductor  alloys  (Chen  et  al.,  1988).  This 
theory  starts  by  assuming  that  the  alloy  has  an  effective  lattice  constant  and  effective  modulus. 
When  part  of  the  effective  alloy  medium  is  replaced  by  a  specified  cluster,  there  will  be  strain 
energy  introduced.  It  was  shown  that  this  strain  energy  can  be  taken  as  the  effective  energy  £(n) 
for  that  cluster,  the  probability  distribution  pn  within  a  statistical  theory  can  then  be  deduced. 

The  internal  strain  energy  is  calculated  as  E  =  M<E>  =  M2^>iiE(n).  When  the  alloy  is  under  an 
external  pressure  5P,  the  effective  cluster  energy  will  change  by  an  amount  5e(n),  which  implies 
a  change  of  the  total  energy  by  an  amount  SE  =  M<Se  (n)>.  Then  the  bulk  nnodulus  of  the  alloy 
can  be  obtained  from  AE  =  ^  (5P)  ^V/B,  where  V  is  the  alloy  volume.  The  mean-field  nature  of 
this  approach  is  evident  from  the  fact  that  the  calculation  requires  knowledge  of  the  alloy  lattice 
constant  and  elastic  constants  which  are  only  assumed  and  are  required  to  be  calculated  self- 
consistently.  To  illustrate  this  self-consistency  procedure,  let  us  consider  the  following  simple 
spring  tTKxlel  for  a  random  pseudobinary  alloy  Ai-xBxC.  The  cluster  corresponds  to  the  four 
bonds  surrounding  an  "impurity"  atom  A  or  B,  and  the  environment  of  the  cluster  corresponds  to 
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the  twelve  bonds  which  connect  inwardly  to  the  cluster  and  outwardly  to  a  rigid  lattice  of  the 
effective  alloy.  It  is  worth  mentioning  that  there  have  been  detailed  analyses  of  the  valence-field 
force  models  for  the  strain  energies  of  semiconductor  alloys  in  regard  to  the  range  allowed  for 
lattice  relaxation  (Martins  and  Zunger,  1984;  Chen  and  Sher,  1985).  It  was  found  that  by 
neglecting  the  bond-angle  fences,  one  can  use  a  shorter  range  of  lattice  relaxation  to  obtain  the 
correct  mixing  enthalpies  and  bond  lengths.  The  single  model  considered  here  works  amazingly 
well  for  these  properties. 

Let  the  spring  constants  for  the  pure  AC  and  BC  compounds  be  La  and  kg  respectively, 
and  the  effective  alloy  spring  constant  be  k,  with  similar  notations  for  the  bond  lengths  dA,  de 
and  d.  When  an  A  atom  is  embedded  in  the  medium,  all  sixteen  bonds  under  consideration  will 
relax,  and  the  strain  energy  is  given  by 

e(A)  =  lki(d-dA)2  ,  (128) 

where 

ki=4kAk/(3kA  +  k)  .  (129) 

A  similar  energy  e(B)  is  obtained,  when  a  B  atom  is  embedded.  The  effective  bond  length  d  is 
obtained  from  a  minimization  of  the  average  cluster  energy  E  =  (1  -  x)  e  (A)  +  xe  (B)  with 
respect  to  d,  which  yields 

d  =  [(l -x)kidA  +  xk2dB]/[(l -x)ki +xk2]  .  (130) 

When  the  alloy  is  compressed,  the  alloy  bond  length  is  reduced  to  d(l  -  e),  where  e  is  a  macro¬ 
scopic  strain  corresponding  to  the  external  pressure.  The  pressure  induced  strain  energy  for  the 
sixteen  bonds  in  the  medium  is  5E  =  8k(de)2,  and  5E  =  2k(de)^  for  each  cluster.  Embedding  an 
A  atom  in  this  compressed  medium,  one  finds  the  total  strain  energy  for  the  sixteen  bonds  to  be 

EA=^k,(d-dA-4de)2  .  (131) 
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To  obtain  the  extra  cluster  energy  5e(A)  induced  by  the  pressure,  we  subtract  e(A)  of  Eq.  ( 1 27) 
and  the  background  energy  for  the  surrounding  twelve  bonds  from  Ea  to  give 

8e(A)  =  -4ki  (d  -  dA)  de  +  8ki  (de)^  -  6k  (de)^  .  (132) 

Similarly  the  following  expression  for  8e(B)  is  obtained  when  the  embedded  atom  is  B: 

6e(B)  =  -4k2  (d  -  da)  dc  +  8k2  (de)^  -  6k  (de)^  .  (133) 

Thus  the  change  of  the  average  cluster  energy  that  is  due  to  the  pressure  is  given  by 

=  <5e  (n)>  =  (1  -  x)  5e  (A)  +  xSe  (B) ,  which,  when  equated  to  2k(de)2,  leads  to  the  following 
self-consistent  equation  for  the  effective  spring  constant  k:k  =  (1  -  x)  ki  +  xk2.  The  k  can  now 
be  solved  analytically  when  both  the  expression  for  ki  in  Eq.  (129)  and  the  similar  expression  for 
k2  are  used.  The  result  is 

k  =  <k>[l-3x(l-x)(5k/<k>f]  ,  (134) 

where  <k>  =  (l  -  x)  kA  +  xks  is  the  mean  spring  cmistant  and&  =  kA  -  ks  the  difference.  It  is 
interesting  to  compare  this  result  for  the  50/50  alloy,  i.e.,k  s  k|l  -  J  Ao)  with  the  value 

k  (l  -  Ao  /4)  in  Eq.  (123)  for  the  ordered  alloys  in  the  CuAuI  and  chalcopyiite  structures.  The 
alloy  spring  constant  is  slightly  below  the  straight  line  average,  and  the  bowing  is  larger  for  a 
disordered  alloy  than  for  the  corresponding  ordered  compound.  Using  the  effective  spring  con¬ 
stant  of  Eq.  (134),  we  find  that  the  effective  bond  lengths  for  most  alloys  also  bow  slightly  below 
their  mean  value, 

d  =  <d>  +  4x  (1  -  X)  (dA  -  da)  (kA  -  ke )  k/[(3kA  +  k)  (3kB  +  k)  ]  (135) 

because  the  spring  constant  tends  to  increase  as  the  bond  length  decreases. 

To  compare  the  calculations  above  witii  experimental  data  for  pseudobinary  alloys,  we 
were  able  to  find  results  for  GaAlAs  (Landolt-Bomstein,  1988),  CdZnTe,  CdMnTe,  and  HgCdTe 
((^adri  et  al.,  1986).  Fot  GaAlAs,  the  following  linear  x  dependences  were  measured  (Landolt- 
Bomstein,  1988):  Cii  =  1 1.85  +  0.14x,  C12  =  5.38  +  0.32x,  and  C44  =  5.94  -  0.05x.  This  lack  of 
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detectable  bowing  is  expected,  because  of  the  nearly  equal  bond  lengths  of  the  two  constituent 
compounds  and  small  differences  in  the  elastic  constants.  The  bulk  moduli  in  the  three  Il-VI 
alloy  systems  mentioned  were  obtained  from  high-pressure  x-ray  diffraction  daia.  For  HgCdTe, 
the  results  are  similar  to  that  of  the  GaAlAs  in  that  both  the  bond  lengths  and  the  bulk  moduli  of 
HgTe  and  CdTe  are  so  close  that  the  differences  of  B  between  the  alloys  and  the  pure  crystals 
were  beyond  the  experimental  resolution.  However,  a  5%  Zn  in  CdZnTe  alloy  was  found  to  give 
a  15%  increase  in  the  B  value  from  the  pure  CdTe  value  and  a  10%  Mn  in  CdMnTe  gave  a  21% 
decrease  (Quadri  et  al.,  1986).  These  significantly  large  changes  in  the  B  values  caused  by 
smaUer  concentrations  cannot  be  explained  fh»n  the  above  considerations. 

The  qualitative  nKxlel  considered  above  does  not  apply  to  the  binary  alloys  Ai.x  Bx, 
because  in  these  alloys  both  the  A  and  B  atoms  can  be  found  in  both  sublattices,  and  the  local 
bond  length  arrangement  is  more  complicated  than  the  pseudobinary  alloys.  However,  one  can 
expect  that  there  are  still  more  degrees  of  relaxation  in  the  disordered  binaries  than  in  the  ordered 
compounds.  Therefore  one  would  conclude  that  the  bulk  modulus  of  the  disordered  50-50  SiGe 
alloy  would  have  a  smaller  value  than  those  tabulated  in  Table  13  for  the  ordered  compounds.  At 
least  one  would  not  expect  the  alloy  B  values  to  be  significantly  larger  than  the  mean  values  B. 
However,  the  only  experimental  data  available  (Bublik  et  al.,  1974),  Table  15,  show  that  all  three 
elastic  constants  for  these  alloys  at  three  different  concentrations  exceed  the  values  for  Si,  and 
that  the  AB  /B  is  as  large  as  20%,  despite  the  fact  that  the  bond  length  difference  between  Si  and 
Ge  is  only  about  3%  and  the  measured  alloy  lattice  constants  are  only  bowed  slightly  below  the 
average.  This,  and  the  unexplained  results  for  the  II- VI  alloys,  point  to  the  need  for  a  more  sys¬ 
tematic  study  of  the  elastic  properties  of  semiconductor  alloys,  both  experimentally  and 
theoretically. 
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IV.  DISLOCATIONS  AND  HARDNESS? 


Hardness  has  proven  to  be  a  useful  probe  of  die  mechanical  properties  of  the  brittle  semi¬ 
conductors.  Here  we  will  use  the  term  hardness  to  refer  specifically  to  Vickers'  hardness,  unless 
other  wise  noted.  In  the  Vickers'  hardness  measurement,  a  square  pyramidal  indenter  is  used  and 
the  hardness  number  is  given  by  the  applied  load  divided  by  the  area  of  the  indentation  (i.e.,  units 
of  pressure).  Hardness  has  been  found  to  be  an  intrinsic  property  of  the  material,  because  it  is 
relatively  independent  of  the  applied  load.  One  advantage  of  hardness  measurements  for  semi¬ 
conductors  is  that,  unlike  bending  tests,  only  small  sanqiles  are  necessary  for  conventional 
Vickers'  hardness  measurements,  or  for  nanoindenter  measurements  (Fang  et  al.,  1990),  and  rel¬ 
atively  thin  epitaxial  films  can  be  probed.  Additimially,  unlike  conventional  tests  used  to  mea¬ 
sure  yield  stress,  hardness  measurements  can  be  made  at  town  tenoperature,  which  is  far  below 
the  usual  plastic  regime  for  noost  serniconducmrs.  As  such,  the  hardness  measurement  provides  a 
convenient  and  usable  probe. 

The  question  remains,  though,  as  to  the  interpretation  of  the  hardness  measurement  in 
semiconductors:  just  what  property  or  properties  of  a  semiconductor  are  we  measuring  when  we 
measure  hardness.  In  metals,  an  empirical  relationship  is  found  between  the  hardness,  H,  and 
yield  stress,  Y,  such  that  H  =  3  Y. 

In  metals,  this  relationship  can  be  justified  on  the  basis  of  continuum  theory,  as  discussed 
in  McClintock  and  Argon  (1966).  In  semiconductOTS  such  a  simple  relationship  between  H  and 
A  is  not  necessarily  appropriate  for  several  reasons.  During  deformation  in  metals,  many  slip 
planes  can  be  active  because  the  Peierls  barriers  for  dislocation  trx>tion  in  most  directions  are 
low.  In  contrast,  because  the  bonds  in  semiconductors  are  strongly  covalent,  the  Peierls  barriers 
are  high  and  dislocation  in  the  (1 1 1)  1/2  <T10>  slip  system  dominate. 
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To  date,  there  is  no  complete  quantitative  theory  of  hardness  in  the  semiconductors  in 
which  the  temperature  dependence,  photoplastic  effect,  and  the  alloy  hardening  effect  are 
included.  Sher  et  al.  (1985)  proposed  a  model  of  hardness  for  the  semiconductor  compound  that 
gives  good  quantitative  agreement  with  experiment,  but  this  model  does  not  provide  an  explana¬ 
tion  for  several  of  the  observed  dependences  of  hardness.  This  model  of  hardness  in  semicon¬ 
ductors  differs  from  more  conventional  interpretations  and  suggests  that  hardness  is  dominated 
by  dislocation-dislocation  interactions,  as  opposed  to  dislocation  activation  and  motion  terms. 
We  discuss  the  results  of  an  improved  quantitative  model  of  hardness  below. 


12.  SUP  SYSTEMS 

In  the  dislocation  interaction  hardness  model,  Vickers'  hardness  is  found  to  be  dominated 
by  the  interaction  energy  of  an  idealized  array  of  dislocations  that  has  been  generated  by  the 
indenter.  The  idealized  array  can  be  considered  as  a  frrst  approximation  to  the  more  realistic  dis¬ 
location  tangles  found  experimentally,  leading  in  higher  order  to  an  expansion  in  dislocation 
confrgurations.  In  this  idealized  array,  no  account  is  taken  of  the  true  slip  systems  active  in  the 
semiconductors.  Experiments  (Hirsch  et  al.,  1985)  have  demonstrated  that,  in  Vickers'  hardness, 
slip  occurs  primarily  on  the  ( 1 1 1 }  1/2  <Tl()>  glide  set,  where  the  threefold  symmetry  of  slip  and 
rosette  lines  occurs  at  the  intersection  of  the  ( 1 1 1 }  planes  with  the  (1 1 1)  surface. 

For  indentation  on  the  (1 1 1)  plane,  dislocations  can  glide  on  the  (1 1 1)  plane  parallel  to 
the  surface  or  on  one  of  the  three  other  (111)  planes  with  a  total  of  four  active  slip  planes. 
Although  the  detailed  analysis  differs  from  that  given  previously  (Sher  et  al.,  1985),  the  contribu¬ 
tion  to  the  hardness  from  the  interaction  energy  is  comparable  to  that  previously  calculated.  This 
contribution  to  H  is  directly  proportional  to  the  shear  coefficient. 
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13.  PEIERLS  ENERGY 

The  Peierls  energy  is  difficult  to  calculate  precisely  because  of  dislocation  charge  effects 
and  reconstruction  at  the  dislocation  core.  In  the  context  of  the  hardness  measurement  we  calcu¬ 
late  the  Peierls  energy  in  order  to  evaluate  the  importance  of  diis  contribution  to  the  Vickers’ 
hardness  number.  Although  it  is  generally  agreed  diat  dislocations  in  semiconductors  move 
through  the  generation  and  propagation  of  double  Idnks,  in  the  hardness  measurement,  the  region 
about  die  indenter  is  grossly  plastically  defoniMd.  Because  the  dislocation  velocity  is  low  at 
room  temperature  (see  below),  the  large  dislocation  pile-up  model  proposed  by  Sher  et  al.  (1985) 
may  be  appropriate.  If  the  dislocation  separation  is  small,  dislocation  motion  through  kink  pro¬ 
cesses  will  be  suppressed,  and  the  dislocations  will  propagate  as  a  complete  unit 

To  get  from  Configuration  A  to  ConfiguratitMi  B  in  Figure  7,  we  must  break  a  row  of 
bonds.  Since  the  long-range  strain  fields  should  be  comparable  in  the  two  configurations  as  well 
as  in  intermediate  configuraticms,  the  Peierls  fence  can  be  calculated  from  local  energy  consider¬ 
ations  only.  The  energy  to  break  a  bond  at  the  dislocation  core  is  approximately  given  by: 

Ub  =  2Vvl  +  v|+2e„«-Vo  +  3-(Ej;-ES)  ,  (136) 

where  V2  is  the  covalent  energy,  V3  the  ionic  energy,  Emet  the  metallization  eno-gy,  Vq  the  bond 
overlap  energy,  and^  and  the  hybrid  energy  fen  the  anion  and  cation,  respectively.  The  first 
two  terms  in  Eq.  (136)  account  for  the  loss  of  the  bonding  energy  of  the  two  electrons  in  the 
breaking  bond,  the  third  term  accounts  for  regaining  the  repulsive  interaction  energy  of  the  bond, 
and  the  fourth  term  accounts  for  the  energy  gain  to  transfer  electrons  back  from  the  cation  to  the 
anion.  We  note  that  the  electron  orbitals  of  the  atoms  at  the  dislocation  core  are  left  in  the  sp^ 
hybrids  after  the  bond  breaking.  The  expression  in  Eq.  (136)  represents  a  theoretical  maximum 
of  the  Peierls  energy,  since  no  reconstruction  at  the  core  has  been  included. 
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To  calculate  the  Peierls  energy  per  unit  length,  we  consider  a  primary  dislocation  in  the 
<T10>  direction  in  a  zincblende  con:^)ound.  The  number  of  bonds  per  unit  length  in 
<T10>  is  given  by  1/b  where  b  is  Burger's  vector.  Thus,  the  Peierls  energy  per  unit  length  is 
given  by 


Ep  = 


Ut 

b 


(137) 


where  d  is  the  bond  length. 

We  can  now  calculate  the  Peierls  force,  or  the  force  per  unit  length  necessary  to  move  a 
dislocation  over  the  potential  barrier,  as  illustrau»i  in  Hgure  8.  The  Peierls  energy  is  related  to 
the  Peierls  force  through 

Ep  =  FpL  .  (138) 


where 

L  =  t  (139) 

is  the  distance  between  Configurations  A  and  Configuration  B.  Solving  for  Fp  in  terms  of  Ub 
and  d,  we  arrive  at: 

Fp  =  J^  ,  (140) 

4 


Values  for  Ub  and  Tp  are  summarized  in  Table  16. 

Now  we  incorporate  the  Peierls  energy  into  the  hardness  ttKxlel  for  low  temperature 
where  the  full  barrier  must  be  surmounted.  The  Sher  model  is  based  on  energy  considerations. 
The  Vickers'  hardness  number  is  given  by  the  applied  force  divided  by  the  area  of  indentation. 
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Multiplying  the  numerator  and  denominator  by  h,  the  depth  of  indentation,  we  have  H  = 
E/(W^h),  where  E  =  Fh  is  the  energy  of  indentation,  and  h  is  the  depth  of  the  indentation. 
Including  the  interaction  energy  only  we  have: 


(142) 


where  6  is  one  half  the  indenter  angle.  To  include  the  Peierls  energy,  we  consider  the  total 
energy  necessary  to  move  the  dislocaticms  from  their  initial  to  final  positions  in  the  idealized 
model.  The  hardness  is  then  given  by: 

H  =  Hi„,  +  Hp  (143) 


where 


(144) 


is  the  Peierls  contribution  to  the  hardness,  and  Ep  is  the  total  Peierls  energy  expended.  The  total 
length  of  dislocation  to  be  moved  is  calculamd  to  be: 

Lr^ia^cos^e  .  (145) 

6  b2 


The  total  Peierls  energy  is  given  by: 


(146) 


Thus,  we  have: 

Hp  =  ^^cos0  sin©  . 


For  0  =  45*: 


Hp 


=  iUh. 
6b5  ‘ 


(147) 


(148) 
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Values  of  Hp  are  summarized  in  Table  16.  Several  features  of  Hp  should  be  noted.  Hrst, 
we  have  used  a  zero  temperature  value  of  the  Peinls  energy.  Because  hardness  measurements 
are  typically  done  at  room  temperature,  one  should  take  die  thermal  energy  into  account;  this  will 
reduce  the  values  of  Hp  from  those  listed  in  Table  16.  Also  shown  in  Table  16  are  Him.  Him  + 
H^  the  best  theoretical  estimate  for  H,  and  Hexp.  Note  that,  like  Him,  Hp  is  independent  of  the 
s^lied  load,  in  agreement  with  expeiiment  Also  note  that  Hp  improves  the  agreement  between 
theory  and  experiment  for  the  hard,  nonpolar  materials.  For  the  softer,  nxme  ionic  materials,  H  is 
overestimated  by  the  themy.  The  overestimatitm  of  H  may  be  because  of  neglect  of  dislocation 
velocity  effects  and  their  temperature  dependence,  as  discussed  below. 


14.  TEMPERATURE  DEPENDENCE 

Here  we  summarize  the  experimental  results  and  discuss  a  tentative  thetny  of  the  temper¬ 
ature  dependence  of  the  hardness. 

Several  recent  studies  on  the  temperature  dqiendence  of  hardness  serve  to  illustrate  the 
behavior.  Results  for  GaAs  and  Ge  are  shown  in  Figure  9.  The  (111)  and  (100)  faces  of  GaAs 
have  been  examined  by  Hirsch  et  al.  (1985)  and  Guruswamy  et  al.  (1986)  respectively.  Results 
for  the  Knoop  hardness  on  the  (1(X))  face  of  n-type  Ge  are  also  shown  (Roberts  et  al.,  1986).  The 
(100)  face  of  GaAs  and  the  Ge  show  a  definite  teiiq)erature  dependence  with  a  relatively  temper¬ 
ature  independent  region  for  T  <  4S0K  and  an  exponential  temperature  dependence  for  T  > 

550  K: 


H  =  HoekT 


(149) 


with 


U  =  0.24eV  (150) 

for  (100)  GaAs.  The  results  for  GaAs  (111)  appear  to  follow  a  similar  behavior. 
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The  temperature  dependence  of  hardness  suggests  that  two  different  mechanisms  may 
determine  hardness  in  the  two  temperature  regimes.  At  low  teiiq)eiature,  the  hardness  is  nearly 
independent  of  temperature  and  may  be  limited  primarily  by  dislocation  interactions.  Disloca¬ 
tion  mobility  is  low  at  low  temperatures,  and  the  tendency  for  dislocation  pile-up  is  high.  At  ele¬ 
vated  temperature,  die  dislocation  mobility  is  increased,  so  that  dislocations  move  more  readily 
under  an  qiplied  stress.  Therefore,  at  higher  temperatures,  dislocation  pile-up  is  reduced  and  the 
hardness  is  limited  by  lattice  friction,  which  shows  a  strong  temperature  dependence.  We  are 
currently  investigating  a  quantitative  theory  of  die  temperature  dependence  of  hardness,  which 
also  includes  the  dissimilar  velocities  of  the  a  and  P  dislocations. 
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V.  CONCLUDING  REMARKS 


The  experimental  medKxis  available  to  measure  elastic  constants  vary  greatly  in  their 
accuracy  and  in  the  size  of  samples  required.  Goierally,  those  that  measure  the  velocity  of  sound 
arequiteaccurate,  some  yielding  elastic  ctmstants  to  one  part  in  10^.  These  methods  are  also 
ciqrable  of  measuring  higher  order  elastic  constants,  a  subject  not  treated  in  this  paper.  However, 
die  samples  required  for  diese  measurements  must  be  large,  of  the  order  of  several  centimeters, 
and  must  be  perfect  bulk  single  crystals.  Many  semiconductors,  alloys  in  particular,  are  only 
grown  as  diin  films  on  disparate  substrates.  these  examples,  the  velocity  of  sound  methods 
fail,  and  the  less  accurate  Rarruui  and  Btillouin  scattering  techniques  bearnie  the  methods  of 
choice.  Their  accuracy  is  about  1  to  4%,  which  is  adequate  for  many  practical  applicatimis. 

Most  group  IV,  m-V  cooqxMind,  and  II-Vl  coopound  semiconductors  have  been  studied, 
and  their  elastic  constants  tabulated.  A  few  remain  to  be  examined,  and  several  should  be  reex¬ 
amined  because  different  experimenters  do  not  agree  on  the  results.  The  situation  in  the  pseu¬ 
dobinary  alloys  is  quite  different  Few  alloy  systems  have  been  adequately  studied;  those  studied 
have  mostly  ftdlen  into  the  class  of  materials  in  vdtich  the  bond  lengths  of  the  cmstituents  nearly 
match.  Mote  interesting  results  ate  expected  from  alloys  with  a  bond  length  misnoatch.  Such 
studies  would  yield  a  wealth  of  infomaation  on  medianisms  re^xmsible  for  conelatkms  in  these 
alloys,  and  periiaps  even  on  those  responsible  for  producing  the  mrieted  alloys  that  have  been 
grown  recently. 

We  have  emphasized  the  utility  of  various  parameterized  models  for  treating  iKMiideal  sit¬ 
uations.  However,  the  most  powerful  new  theoretical  developments  are  in  the  area  of  first-prin¬ 
ciples  theories.  The  advent  of  self-consistent  local  density  theories  (Ho^berg  et  al.,  1964)  over 
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twenty  years  ago,  and  the  advances  in  methods  to  solve  the  Schrodinger  equation,  are  making 
real  inroads  (Methfessel  and  Van  Schilfgaarde,  1990). 

Just  a  few  years  ago,  it  took  many  hours  of  supercomputer  time  to  solve  problems,  so 
only  a  few  existence  proofs  had  been  done  to  demcmstrate  the  accuracy  of  local  density  analysis. 
The  new  LMTO  solution  methods  (Methfessel  and  Van  Schilfgaarde,  1990;  Harris,  1985; 
Foulkes  and  Haydock,  1989;  Van  Schilfgaarde  et  al.,  1991),  and  the  Carr  and  Pamello  extensions 
(1985)  of  APW,  increase  solutkn  speeds  into  tire  realm  where  it  is  practical  to  attack  may 
mechanical  prcqrerty  problems.  Moreover.  LDA  has  now  beat  extended  to  include  many-body 
correcti(»s  (Hybertsen  and  Lxmik,  1987),  so  its  few  percent  deviations  from  experiments  directed 
at  prc^rerties  sensitive  to  total  energies  should  soon  be  conected.  Conduction  baixls  also  are  cor¬ 
rected  1^  the  addition  of  the  many-body  effects,  so  optical  and  transport  properties  can  also  be 
predicted  with  accuracy.  Surfaces,  disordered  alloys,  and  other  situations  where  symnretry  is 
lost,  continue  to  be  a  chaUenge,  but  molecular  dynamic  techniques.  Green's  function  nrethods, 
arul  variants  of  the  coherent  potential  approximation,  show  promise  of  providing  adequate  solu¬ 
tions.  As  computational  speeds  ccmtinue  to  iirerease,  these  rrrethods  will  evolve  into  practical 
engineeting  tools. 
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FOOTNOTES 


iWe  assume  that  the  angles  defining  the  orientadtm  of  axes  in  the  crystal  are  not  qrecified.  For 
a  further  discussicm,  see  Landau  and  Ufshitz  (1986). 

2The  accuracy  depends  upon  the  correct  choice  for  the  resonant  frequency  of  the  transducer, 
which  can  be  obtained  from  the  fact  duu  it  is  1/2  wavelength  in  thickness.  Using  a  micro¬ 
meter,  and  the  wave  velocity  of  the  transducer  mateiial,  one  can  calculate  the  resonant  fre¬ 
quency  of  the  transducer.  Transducer  off-resonant  cmiditions  can  have  a  relatively  large 
influence  on  the  measuranent  of  round-trip  time  for  pulsed  studies. 

^Other  influoices  the  measurement  of  round-trip  time  are  bond  thickness  and  difffacticMi. 

These  are  discussed  in  various  pqrers,  including  McSldmin  (1961)  and  Papadalds  (1967, 
1972). 

^Infonrutitm  on  piezoelectric  transducers  is  available  from  Valpey-Hsher  Ctxp.,  75  South 
Street,  Hopkinton,  MA  01748,  and  Crystal  Technology  Inc.,  1060  E.  Meadow  Circle,  Palo 
Alto,  CA  94303. 

^A  film,  sold  under  the  tradename  KYNAR  is  an  exanqrle  of  this.  Information  on  this  material 
can  be  obtained  from  PennwaltCotp.,  Box  C  King  of  Prussia,  PA  19406-0018.  Some  of  the 
prtqrerties  are  covered  in  Bloomfield  et  al.  (1978).  Lead  attachment  to  KYNAR  films  is  cov¬ 
ered  in  Scott  and  Blocnnfield  (1981). 

^For  example,  supplies  of  damped  ultrasonic  transducers  can  supply  material  suitable  for  high 
temperature  ami  shear  measurements.  Two  contraries  are  Panametrics,  Inc.,  221  Crescent 
St,  Waltham,  MA  02254,  and  Haristmics,  Inc.,  7  Hyde  St,  Stamford,  CT  06907. 

‘^Much  of  this  section  is  adiqrted  from  “Hnal  Report”  (AFOSR-F49620-85-0023)  by  M.A. 
Berding  (1988),  SRI  Intonational,  Menlo  Park,  CA. 
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HGURE  CAPTIONS 


1.  Distortion  of  a  tetrahedron  coneqionding  to  the  C44  elastic  constant;  u  is  the  internal  dis- 
placetnrat  between  the  anion  and  cation  sublatdces. 

2.  Labels  of  the  interactions  between  hybrids  associated  witii  an  adjacent  aiuon-cation  pair. 
Via  (Vic)  is  the  interaction  between  two  hybrids  on  the  same  anion  (catirxi),  the  "constant 
energy"  V2  is  the  interaction  between  aiuon  and  cation  hybrids  that  point  toward  one 
another  along  the  bcmd  directicHi,  Vj,  is  tire  interaction  between  an  animi  hybrid  in  one 
direction  and  an  adjacent  cation  hybrid  pointing  in  a  different  diiectim.  The  "ionic  energy" 
V3  is  half  the  difference  between  cation  and  aition  term  values.  The  lower  segment  of  the 
figure  depicts  the  splitting  of  the  hybrid  energy  levels  by  the  V2,  and  V3  interactimis. 

3.  A  schematic  picture  of  the  hybrids,  treated  as  rigid,  in  a  shear  distortion  leading  to  Cn  - 
C12. 

4.  List  of  possible  crystal  structures  for  ordered  semiconductOT  alloys  (Wei  and  Zunger, 

1989). 

5.  Near  neighbor  bond  lengths  (GaAs  and  InAs  in  the  Gai.xInxAs  alloy  as  a  function  of  com¬ 
position  X,  measured  by  EXAFS  (Mikkelsen  and  Boyce,  1982). 

6.  Quster  peculations  relative  to  those  in  a  random  alloy  xj  -  xj  for  clusters  with  nj  =  0, 1, 2, 
3, 4  B  atoms  for  a  Gai.xInxAs  alloy  equilibrated  at  €00  K. 

7.  Atom  configuratiems  during  the  slip  of  a  dislocation. 
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8.  Schematic  of  the  dislocation  potential  as  a  function  of  its  position. 

9.  Measured  hardness  of  Ge  and  several  GaAs  samples  as  a  function  of  temperature. 
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Figure  2 
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Table  1 .  Comparison  between  calculated  experienmtal  constants  * 


Expt 

FP-LMTO^ 

PP-PWC 

TB 

a 

5.431 

5.41 

5.45 

5.431 

B 

9.923 

9.9 

9.3 

9.923 

Cii  -Ci2 

10.274 

10.2 

9.8 

10.274 

C44 

8.036 

8.3 

8.5 

8.013 

cw 

11.1 

11.30 

0.54<1 

0.51 

0.53 

0.51 

<D 

523 

518 

521 

572 

a 

5.65 

5.59 

5.65 

B 

7.653 

7.2 

7.653 

C11  -C12 

8.189 

8.5 

8.189 

C44 

6.816 

6.3 

6.84 

cw 

7.7 

9.46 

; 

0.44 

0.49 

CD 

303 

302 

342 
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Table  1.  (ConUnued) 


GaAa 


a 

5.642 

5.55 

5.642 

B 

7.69 

7.3 

7.69 

C11 - C12 

6.63 

7.0 

6.63 

C44 

6.04 

6.2 

5.79 

''44 

7.5 

7.83 

c 

0.48 

0.50 

273 

268 

292 

a.  Compaifson  between  calcuiated  and  experimental  lattioe  constant  a.  elastic  constants  B,  Cn  -  C12  and 
C44.  Kleiman  (1962)  internal  (fstortion  parameter  C.  and  the  TO  optical  phomn  co  in  wave  numbers  l/cm. 
Also  Isted  are  defined  in  Eq.  (13).  The  FP-LMTO  and  PP-PW  are  the  ab  inftio  theories  descrflsed  in 
Part  land  TB  is  the  tight<bondlng  theory  in  Part  III,  Section  10.  AR  elastic  constants  are  in  units  of 
I0'*^dynes/cm2. 

b.  Methfessel  et  al.  (1989) 

c.  Nielsen  and  Martin  (1985a) 

d.  Cousins  et  al.  (1987) 

e.  An  the  experimental  lattice  constants  are  those  tabulated  by  ZaRen  (1982).  The  experimental  elastic 
constants  are  taken  from  Table  3.  and  the  phonon  frequencies  are  from  Table  5. 
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Table  2.  Accuracies  of  measurernem  tecbniques. 
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Table  3.  Experimental  values  for  cubic  semiconcfcjctors.* 


Cii 

Cl  2 

C44 

a 

Cb 

107.640 

12.520 

57.740 

0.0 

Sib 

16.772 

6.498 

8.036 

0.0 

Geb 

13.112 

4.923 

6.816 

0.0 

AlSbb 

6.769 

4.341 

4.076 

1.684 

GaPb 

14.390 

6.520 

7.143 

3.815 

QaAsb 

12.110 

5.480 

6.040 

2.827 

GaSbb 

9.089 

4.143 

4.440 

1.569 

InPb 

10.220 

5.760 

4.600 

3.766 

InAsb 

8.329 

4.526 

3.959 

2.820 

InSb^ 

6.918 

3.788 

3.132 

1.372 

ZnSC 

9.420 

5.680 

4.360 

6.788 

ZnSe 

10.790 

7.220 

4.120 

6.788 

ZnSC 

9.810 

6.270 

4.483 

6.788 

ZnSC 

10.460 

6.530 

4.630 

6.788 

ZnSeb 

8.95 

5.39 

3.984 

4.368 

ZnSec 

8.59 

5.06 

4.06 

4.368 

ZnSec 

8.720 

5.240 

3.920 

4.368 

ZnTeC 

7.130 

4.070 

3.120 

2.566 

ZnTeC 

7.220 

4.090 

3.080 

2.566 

CdTeb 

5.33 

3.65 

2.04 

3.105 

a 

3 

Ik 

In 

Irom 

129.100 

84.573 

1.00 

1.00 

1.02 

49.247 

13.951 

1.00 

1.00 

1.13 

39.438 

11.583 

1.08 

1.08 

1.05 

33.768 

6.653 

1.11 

1.05 

1.08 

46.965 

10.448 

1.12 

1.05 

1.08 

40.895 

9.159 

1.12 

1.06 

1.05 

33.123 

7.412 

1.10 

1.06 

1.06 

41.095 

6.250 

1.20 

1.07 

1.03 

33.744 

5.531 

1.22 

1.11 

1.00 

29.909 

4.951 

1.17 

1.11 

1.05 

37.026 

4.571 

1.33 

1.07 

0.95 

45.126 

4.341 

1.28 

1.02 

1.01 

39.947 

4.300 

1.42 

1.13 

0.90 

41.880 

4.828 

1.33 

1.08 

0.95 

34.432 

4.716 

1.28 

1.09 

0.98 

34.519 

4.673 

1.32 

1.13 

0.95 

35.469 

4.603 

1.29 

1.10 

0.98 

29.976 

4.452 

1.18 

1.06 

1.05 

30.204 

4.558 

1.14 

1.03 

1.08 

27.058 

2.455 

1.34 

1.07 

0.98 
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Table  3.  (Continued) 


CdTec 

6.150 

4.300 

1.960 

3.105 

HgTeb 

5.971 

4.154 

2.259 

2.381 

HgTeC 

5.63 

3.66 

2.11 

2.381 

31.546 

2.731 

1.16 

0.94 

1.13 

30.300 

2.542 

1.37 

1.16 

0.96 

26.919 

2.542 

1.37 

1.15 

0.95 

a.  Experimental  elastic  constants  for  some  cubic  semiconductors  and  the  parameter  s  of  Eq.  (81)  taken 
from  Martin  (1970)  with  the  force  constants  a  and  ^  obtained  from  Eqs.  (82)  and  (83)  and  the  identity 
relations  Ik.  Im  and  tacM  given  by  Eqs.  (78),  (88).  and  (119),  respectively.  The  elastic  constants  C  are  in 
units  of  IQI^  dynes/cm^.  and  the  remainder  are  in  lO^dynesfora 

b.  Data  quoted  from  'Landolt-Bomstein  Numerical  Data  and  Functionai  Relationships  in  Science  and 
Technology."  New  Series,  Vote.  17  and  22. 

c.  Listed  in  the  review  by  Mitra  and  Massa  (1982). 
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Table  4.  Values  for  the  term  values  used  fr>  the  present  calculations  amd  Mann's  (1967)  Hartree-Fock 
values  as  used  by  Harrison  (1980). 


Element 

Present 

Mann 

es 

ep 

es 

ep 

Cu 

-7.72 

-2.96 

-7.72 

-2.37 

Ag 

-7.57 

-3.54 

-7.06 

-2.61 

Au 

-9.22 

-3.91 

•6.98 

-2.67 

Be 

-9.32 

-5.41 

-8.41 

-5.79 

Mg 

-7.62 

-2.97 

-6.88 

-3.84 

Zn 

-9.39 

-4.09 

-7.96 

-4.02 

Cd 

-8.99 

-4.17 

-7.21 

-3.99 

Hg 

-10.43 

-4.35 

-7.10 

-3.95 

B 

-14.00 

•8.30 

-13.46 

-8.43 

Al 

-11.78 

•5.98 

-10.70 

-5.71 

Ga 

-13.23 

•5.90 

-11.55 

-5.67 

In 

-12.03 

-5.56 

-10.14 

-5.37 

C 

-19.81 

-11.26 

-19.37 

-11.07 

Si 

-15.03 

-8.15 

-14.79 

-7.58 

Ge 

-16.40 

-7.75 

-15.15 

-7.33 

Sn 

-14.53 

-7.03 

-13.04 

-6.76 

Pb 

-15.25 

-6.45 

-12.48 

-6.53 

N 

-26.08 

-14.54 

-26.22 

-13.84 

P 

-19.62 

-10.57 

-19.22 

-9.54 

As 

-20.02 

-9.93 

-18.91 

-8.98 

Sb 

-17.56 

-8.77 

-16.02 

-8.14 
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Table  4. 

0 

(Continued) 

-28.55 

-13.561 

-34.02 

-16.72 

1 

1 

1 

S 

-21.16 

-10.39 

-24.01 

-11.60 

■ 

Se 

-21.41 

-9.90 

-22.86 

-10.68 

1 

Te 

-19.12 

-9.32 

-19.12 

-9.54 

1 

F 

-36.23 

-17.44 

-42.78 

-19.86 

■ 

a 

-25.81 

-13.05 

-29.19 

-13.78 

1 

Br 

-24.95 

-12.01 

-27.00 

-12.43 

1 

-21.95 

-10.79 

-22.34 

-10.97 

1 

—  a 
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Table  5.  Experimental  Properties  of  Semiconductors.^ 


d 

Ebond 

B 

C 

C44 

0) 

c 

1.540 

•3.68 

44.227 

95.120 

57.740 

1332 

Si 

2.352 

-2.32 

9.923 

10.274 

8.036 

520 

Ge 

2.450 

-1.94 

7.653 

8.189 

6.816 

301 

AlP 

2.367 

-2.13 

8.600 

6.900 

6.150 

440 

AlAs 

2.451 

-1.89 

7.727 

7.160 

5.420 

361 

AlSb 

2.656 

-1.76 

5.817 

4.428 

4.076 

366 

GaP 

2.360 

-1.78 

9.143 

7.780 

7.143 

367 

GaAs 

2.448 

-1.63 

7.690 

6.630 

6.040 

269 

GaSb 

2.640 

-1.48 

5.792 

4.946 

4.440 

231 

InP 

2.541 

-1.74 

7.247 

4.460 

4.600 

304 

InAs 

2.622 

-1.55 

5.794 

3.803 

3.959 

219 

InSb 

2.805 

-1.40 

4.831 

3.130 

3.132 

185 

ZnS 

2.342 

-1.59 

7.637 

3.990 

4.558 

279 

ZnSe 

2.454 

-1.29 

6.457 

3.560 

3.984 

213 

ZnTe 

2.637 

-1.20 

5.090 

3.060 

3.120 

177 

CdTe 

2.806 

-1.10 

4.210 

1.680 

2.040 

141 

HgTe 

2.798 

-0.81 

4.759 

1.817 

2.259 

116 

a.  Vahjes  of  bond  length  d,  bond  energy  Ebond  'bulk  morkilus  B  and  shear  coefficient  C  -  1.C12  used 

to  determine  the  parameters  in  Tables  7  through  1 1 .  Also  isted  are  the  experimental  values  of  C44  and 
the  TO  optical  phonon  mode  ca  at  rto  be  compared  writh  the  calculations.  All  the  elastic  constants  are  in 
units  of  10^  ^  dynes/cm^,  d  in  A,  Etnnd  <n  eV,  and  in  terms  of  wave  numbers  in  1/cm. 
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Tables.  (Continued) 

All  bond  lengths  are  deduced  from  the  lattice  constants  quoted  by  Zallen  (1982).  The  values  of  Ebond 
taken  from  Harrison  (1980),  Table  7-3,  except  for  AlSb,  ZnTe,  CdTe,  and  HgTe,  whidi  are  deduced  from 
the  Phillips  (1973)  Table  8.2.  The  elastic  constants  are  taken  from  Table  3,  and  the  phonon  frequencies 
are  taken  from  vakies  compiled  in  ‘Landolt-Bomstein  Numerical  Data  and  Functional  Relationships  in 
Sdenoe  and  Technology, *  New  Series,  edited  by  K.-H.  Helllwedge,  Vols.  17  and  22. 


Table  6.  Comparison  of  BOM  and  BS  Models'  Predictions  * 


^bonds  B _ C _ 


BOM 

BS 

BOM 

BS 

BOM 

BS 

BOM 

BS 

c 

-7.17 

-7.08 

49.31 

47.39 

67.15 

67.10 

44.45 

51.78 

Si 

-2.39 

-2.50 

4.65 

4.18 

7.19 

8.07 

5.10 

5.29 

Ge 

-2.03 

-2.34 

2.89 

1.91 

4.72 

6.58 

3.69 

3.00 

AlP 

-2.59 

-2.54 

3.81 

4.21 

4.45 

4.83 

3.45 

4.10 

AlAs 

-2.22 

-2.20 

3.12 

3.66 

3.85 

4.07 

2.91 

3.50 

AlSb 

-1.64 

-1.62 

1.99 

2.47 

2.75 

3.12 

2.11 

2.54 

GaP 

-2.32 

-2.16 

3.63 

4.40 

4.48 

5.15 

3.69 

4.34 

GaAs 

-1.99 

-1.85 

2.83 

3.76 

3.75 

4.30 

3.02 

3.66 

GaSb 

-1.53 

-1.43 

1.76 

2.45 

2.63 

3.40 

2.17 

2.65 

InP 

-2.21 

-2.06 

2.35 

2.91 

2.69 

2.93 

2.21 

2.63 

InAs 

-1.89 

-1.74 

1.93 

2.67 

2.38 

2.52 

1.90 

2.32 

InSb 

-1.43 

-1.28 

1.28 

1.97 

1.81 

2.17 

1.48 

1.87 

ZnS 

-1.83 

-1.79 

3.46 

3.74 

3.53 

3.62 

2.71 

3.30 

ZnSe 

-1.52 

-1.48 

2.67 

3.06 

2.75 

2.67 

2.08 

2.53 

ZnTe 

-1.15 

-1.05 

1.82 

2.16 

1.88 

1.87 

1.50 

1.78 

CdTe 

-1.06 

-0.97 

1.24 

1.47 

1.22 

1.14 

0.96 

1.13 

HgTe 

-0.72 

-0.49 

1.23 

1.72 

1.33 

1.31 

1.11 

1.30 

a.  Comparison  of  the  tight-binding  theory  using  the  full  band  stnjctures  (BS)  and  the  bond  orbital  model 
(BOM)  for  bond  energies  E|x)nd>  buk  nKXhifi  B,  and  shear  coefficients  C  >  C^  ^  -  C^2  ^44- 

energies  are  in  eV  and  elastic  constants  in  lO^^dynes/cm^. 


D-111 


Table?.  Full  band  structure  calculation.* 


f 

n 

m 

Uo 

C44 

c 

0) 

c 

1.390 

2.840 

3.767 

21.924 

48.393 

0.121 

1459 

Si 

1.326 

3.040 

5.001 

6.938 

8.013 

0.511 

572 

Ge 

1.388 

3.204 

5.278 

6.415 

6.841 

0.487 

342 

AlP 

1.294 

3.530 

5.598 

6.435 

5.827 

0.516 

447 

AlAs 

1.464 

3.524 

5.430 

7.089 

5.598 

0.459 

384 

AlSb 

1.337 

3.268 

5.668 

4.838 

3.944 

0.564 

354 

GaP 

1.395 

3.705 

5.683 

7.285 

6.857 

0.501 

382 

GaAs 

1.397 

3.633 

5.716 

6.530 

5.791 

0.500 

292 

GisSt) 

1.431 

3.471 

5.717 

5.519 

4.515 

0.536 

256 

InP 

1.323 

4.240 

6.633 

5.603 

4.260 

0.584 

304 

InAs 

1.300 

3.997 

6.427 

4.962 

3.564 

0.552 

220 

InSb 

1.353 

3.773 

6.399 

4.350 

3.092 

0.602 

200 

ZnS 

1.062 

3.308 

5.996 

4.225 

3.727 

0.632 

325 

ZnSe 

1.134 

3.420 

5.994 

4.260 

3.164 

0.576 

233 

ZnTe 

1.284 

3.3096 

5.828 

4.285 

2.813 

0.590 

205 

CdTe 

1.171 

3.656 

6.761 

3.092 

1.701 

0.694 

156 

HgTe 

1.173 

3.760 

7.074 

3.080 

2.040 

0.716 

152 

a.  The  results  for  the  parameters  I,  n,  m,  and  Uq  obtained  from  the  fitting  of  the  bond  energy,  bond 
length,  bul(  modulus,  and  shear  coefficient  Cn  •  C12  of  Table  5  using  the  full  band  structure  calculation. 
Also  listed  are  the  calculated  C44,  internal  displacement  parameter  C.  and  the  TO  optical  phonon  mode  co 
at  r.  All  the  elastic  constants  are  in  units  of  10^^  dynes/cm^,  Uo  is  in  eV,  and  OD  are  wave  numbers  in 
1/cm. 
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Tables.  Two  sets  of  TB parameters.* 


Si 


4 

4 

4 

4 

< 

m 

• 

Q 

Vspo 

Vppo 

Vppn 

Chadi 

0.0 

7.20 

0.0 

7.20 

-2.03 

2.55 

4.55 

-1.09 

Present 

0.0 

6.88 

0.0 

6.88 

-2.41 

2.59 

4.05 

-1.15 

Uo 

ui 

U2 

C 

C44 

c<®> 

''44 

c 

CD 

Chad 

7.29 

•9.98 

23.90 

10.66 

7.89 

11.38 

0.49 

620 

Present 

6.93 

-9.70 

23.42 

10.27 

7.83 

11.39 

0.51 

592 

GaAs 

4 

4 

4 

4 

Vmo 

yAC 

’'spo 

CA 

spa 

Vppo 

VppK 

Chad 

0.0 

9.64 

5.12 

11.56 

-1.70 

2.40 

1.90 

3.44 

-0.89 

Present 

0.0 

10.09 

6.79 

14.12 

•2.34 

2.52 

2.52 

3.94 

-1.12 

uo 

Ui 

U2 

C 

C44 

c(o) 

*^44 

c 

(0 

Chad 

5.12 

-7.12 

18.22 

6.36 

5.60 

8.77 

0.54 

339 

Present 

6.53 

•8.39 

19.90 

6.63 

5.70 

8.53 

0.54 

322 

a.  Comparison  between  the  two  (fifferent  sets  of  TB  parameters  described  in  the  text,  the  resultant 
expression  coefficients  uo.  ui ,  and  112  of  the  repulsive  pair  energy  u,  and  the  predicted  eiastic  constants, 
Kleinmann  interrtal  displacement  parameters  C,  and  phonon  frequency  u  from  Chadi  fitting  scheme. 
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Tables.  BOM(l)caiculations.B 


f 

n 

m 

c 

1.440 

2.896 

3.809 

Si 

1.356 

3.208 

5.166 

Ge 

1.395 

3.666 

5.854 

AlP 

1.047 

3.334 

5.842 

AlAs 

1.264 

3.530 

5.685 

AlSb 

1.179 

3.170 

5.831 

GaP 

1.154 

3.439 

5.744 

GaAs 

1.179 

3.337 

5.647 

GaSb 

1.274 

3.354 

5.797 

InP 

0.999 

3.579 

6.490 

InAs 

0.995 

3.167 

5.960 

InSb 

1.126 

3.228 

6.105 

ZnS 

0.692 

2.750 

7.262 

ZnSe 

0.750 

2.823 

6.839 

ZnTe 

0.888 

2.715 

6.484 

CdTe 

0.734 

2.720 

8.394 

HgTe 

0.732 

2.202 

8.810 

Uo 

C44 

c 

(0 

22.345 

61.098 

0.142 

2103 

7.023 

9.729 

0.447 

695 

6.383 

8.240 

0.506 

416 

4.418 

6.627 

0.460 

571 

5.538 

6.668 

0.431 

491 

3.818 

4.482 

0.511 

411 

5.239 

7.500 

0.453 

496 

4.863 

6.340 

0.456 

358 

4.456 

4.950 

0.500 

302 

3.413 

4.558 

0.522 

388 

3.081 

3.787 

0.496 

260 

3.091 

3.267 

0.543 

224 

1.642 

4.312 

0.580 

401 

1.821 

3.685 

0.544 

287 

1.987 

3.156 

0.531 

248 

1.p18 

1.949 

0.660 

189 

0.842 

2.140 

0.677 

177 

a.  The  results  for  the  parameters  f ,  n,  m,  arid  uq  obtained  from  the  fitting  of  the  bond  energy,  bond 
length,  bulk  modulus  and  shear  coefficient  Ci  i*Ci2  of  Table  5  using  the  BOM(1 )  described  in  the  text. 
Also  isted  are  the  calculated  C44,  internal  displacement  parameter  and  the  TO  optical  phonon  mode  co 
at  r.  All  the  elastic  constants  are  in  units  of  10^  ^dynes/cm^,  uq  is  in  eV,  and  a>  is  given  in  terms  of  wave 
number  in  1/cm. 
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Table  10.  BOM(2)  calculations.* 


f 

n 

m 

c 

1.440 

2.896 

3.809 

Si 

1.356 

3.208 

5.166 

Ge 

1.395 

3.666 

5.854 

AlP 

1.283 

3.440 

5.493 

AlAs 

1.472 

3.543 

5.419 

AlSb 

1.342 

3.249 

5.580 

GaP 

1.371 

3.540 

5.490 

GaAs 

1.387 

3.455 

5.452 

GaSb 

1.414 

3.398 

5.593 

InP 

1.307 

3.846 

6.115 

InAs 

1.298 

3.528 

5.731 

InSb 

1.342 

3.464 

5.910 

ZnS 

1.062 

3.102 

5.728 

ZnSe 

1.141 

3.150 

5.587 

ZnTe 

1.283 

3.067 

5.498 

CdTe 

1.178 

3.271 

6.216 

HgTe 

1.169 

3.046 

6.116 

Ue 

C44 

c 

©TO 

22.345 

55.161 

0.319 

1672 

7.023 

7.520 

0.652 

597 

6.383 

6.106 

0.720 

358 

6.171 

5.094 

0.677 

511 

6.979 

5.103 

0.640 

440 

4.771 

3.256 

0.706 

373 

6.901 

5.916 

0.688 

428 

6.306 

4.864 

0.681 

317 

5.305 

3.681 

0.717 

264 

5.339 

3.446 

0.744 

354 

4.834 

2.806 

0.706 

243 

4.204 

2.351 

0.747 

207 

4.075 

3.016 

0.726 

373 

4.181 

2.577 

0.688 

271 

4.140 

2.270 

0.704 

231 

3.011 

1.319 

0.764 

177 

2.916 

1.457 

0.793 

162 

a.  The  results  for  the  parameters  f,  n,  m,  and  uo  obtained  from  the  fitting  of  the  bond  energy,  bond 
length,  bulk  modulus  and  shear  coefficient  Cn-Ciz  of  Table  5  using  the  BOM(2)  described  in  the  text. 
Also  Isted  are  the  calculated  C44,  internal  displacement  parameter  C,  and  the  TO  optical  phonon  mode  co 
at  r.  All  the  elastic  constants  are  in  units  of  lO^^dynes/cm^,  uo  is  in  eV,  and  (o  is  wave  number  in  l/cm. 
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Table  11.  B0M(3)  calculations.* 


f 

n 

m 

c 

1.336 

2.901 

3.872 

Si 

1.262 

3.251 

5.346 

Ge 

1.302 

3.886 

6.264 

AlP 

1.193 

3.43 

5.577 

AlAs 

1.369 

3.559 

5.507 

AlSb 

1.249 

3.253 

5.669 

GaP 

1.276 

3.550 

5.575 

GaAs 

1.290 

3.445 

5.511 

GaSb 

1.318 

3.415 

5.704 

InP 

1.210 

3.812 

6.153 

InAs 

1.199 

3.456 

5.724 

InSb 

1.245 

3.411 

5.943 

ZnS 

0.968 

3.062 

5.848 

ZnSe 

1.040 

3.097 

5.671 

ZnTe 

1.178 

3.019 

5.573 

CdTe 

1.065 

3.198 

6.368 

HgTe 

1.059 

2.937 

6.262 

Uo 

C44 

c 

0) 

20.814 

47.691 

0.135 

1531 

6.484 

7.472 

0.580 

562 

5.942 

6.171 

0.658 

333 

5.722 

4.936 

0.541 

452 

6.498 

4.812 

0.490 

387 

4.402 

3.248 

0.619 

343 

6.422 

5.738 

0.542 

373 

5.855 

4.724 

0.549 

281 

4.923 

3.670 

0.621 

239 

4.929 

3.398 

0.604 

304 

4.442 

2.755 

0.579 

216 

3.871 

2.367 

0.657 

188 

3.618 

3.120 

0.630 

343 

3.727 

2.611 

0.580 

247 

3.371 

2.301 

0.601 

212 

2.621 

1.403 

0.690 

165 

2.523 

1.563 

0.730 

154 

a.  The  results  for  the  parameters  f.  n,  m,  and  uo  obtained  from  the  fitting  of  the  bond  energy,  bond 
length,  bulk  modulus  and  shear  coefficient  Ci  1-C12  of  Table  5  using  the  BOM(3)  described  in  the  text. 
Also  fsted  are  the  calculated  C44,  imernal  displacement  parameter  and  the  TO  optical  phonon  mode  u 
at  r.  AH  the  elastic  constarfis  are  in  units  of  10''^dynes/cm2,  uo  is  in  eV,  and  to  is  wave  number  in  i/cm. 
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Table  12.  Ordered  aUoys:  TB  elastic  constant  calculations.* 


AB/B  X  100 


Alloy 

Bch 

AIGaAs 

7.695 

AIGaP 

8.858 

GalnSb 

5.226 

AllnAs 

6.705 

InQaAS 

6.610 

InAlP 

7.876 

Gamp 

8.007 

QaAsP 

8.328 

GaAsSb 

6.314 

GaPSb 

6.584 

HgCdTe 

4.470 

HgZnTe 

4.890 

CdZnTe 

4.611 

Bca 

Bcp 

7.693 

7.689 

8.858 

8.854 

5.202 

5.156 

6.691 

8.661 

6.579 

6.508 

7.860 

7.774 

8.035 

8.878 

8.291 

8.294 

6.198 

6.157 

6.297 

6.188 

4.472 

4.471 

4.887 

4.632 

4.604 

4.338 

B 

Ch 

7.909 

-0.18 

8.872 

-0.15 

5.312 

-1.61 

6.761 

-0.83 

6.742 

-1.96 

7.924 

-0.61 

8.195 

-1.44 

8.417 

-1.05 

6.741 

-8.34 

7.468 

-11.84 

4.485 

-0.33 

4.925 

-0.71 

4.650 

-0.85 

Ca 

Cp 

-0.20 

-0.25 

-0.15 

-0.20 

-2.07 

-2.92 

-1.03 

-1.48 

-2.42 

-3.47 

-0.08 

-1.88 

-1.95 

-3.87 

-1.50 

-1.46 

-8.05 

-8.66 

15.68 

-17.14 

-0.28 

-0.31 

-0.76 

-5.93 

-1.00 

-6.72 

a.  Buk  modui  of  ordered  aHoys  calculated  using  the  fuH  TB  band-stmcture  method  described  in  Part  III, 
Section  10.  The  three  structures  are  chachopyrfte  (Ch),  CuAul  (Ca)  and  Cun(Cp)  types.  §  is  the  average 
value  of  the  cor«tituent  compounds,  and  AB  -  B  -  B. 
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Table  13.  Ofdered  alloys;  Bukmodui.* 


GaAa  GaSb  Ga2AaSb  GaaAsaSb  GaaAsSba 


latructure 

2b 

Zb 

CA 

CH 

CP 

LU 

FA 

LU 

FA 

B 

7.46 

5.18 

6.10 

5.92 

5.96 

6.52 

6.58 

5.40 

5.31 

AB/B  X 100 

-3.5 

•6.3 

-5.7 

-5.4 

-4.5 

-6.1 

-7.7 

a.  Calculated  buk  modul  for  GaAs,  GaSb.  and  GaAsSb  ordered  altoys  by  Ferreira  et  al.  (1989),  and  the 
corresponding  percentage  deviation  from  the  concentration  weighed  average.  The  structures  are 
zincblende  (2b).  CuAKCA).  chachopyrite  (CH).  Cun(CP).  Luaonite  (LU).  and  Famatinite  (FA). 
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Table  14.  Buk  modui  of  ordered  binary  alloys  ^  of  the  (iamond  semloonckjctors  A  and  B  from  theories 
and  experiment. 


SIC 

PP-PW* 

experiment 

B(C) 

50.3 

44.23 

B(SI) 

9.53 

9.92 

B(SiC) 

23.4 

22.4 

AB/B(%) 

-21 

-17 

SIGe 

PP-PW* 

PP-PW*» 

asac 

FP-LMTQC 

B(SI) 

9.53 

9.8 

8.80 

9.58 

B(Ge) 

7.75 

7.7 

6.25 

7.05 

B(SiQe) 

8.73 

8.7 

7.38 

8.31 

AB/B  (%) 

1 

0 

2 

0 

a.  Martins  and  Zunger  (1986) 

b.  Otelsh  and  Resta  (1988) 

c.  van  ScHi^jaarde  (1990) 
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Table  15.  Measured  elastic  constants  in  10^^  dynes/cm^  of  SiGe  alloys  by  Bublik  et  al.  (1974).. 


alloy 

C11 

C12 

C44 

Sio.28Geo.72 

16.1  ±  0.8 

8.35  ±  0.8 

8.55  ±  0.4 

si0.54GE0.46 

17.0  ±  0.8 

Sio.64Geo.36 

17.1  ±  0.8 
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Table  16.  Calculated  Peierls  stress  and  hardness  for  various  zincblende  semiconductors,  with 


experimental  hardness  values  for  comparison.* 


Ub 

Hp 

Hint 

Hp  ♦  H|„, 

H.xp 

c 

11.35 

23.300 

1940 

9244 

11,184 

1 

Si 

5.95 

3430 

286 

1098 

1384 

1370 

Ge 

6.67 

3440 

286 

893 

1179 

1000 

Sn 

5.66 

1930 

161 

AlP 

6.04 

3410 

284 

— 

— 

GaP 

6.15 

3500 

292 

903 

1195 

940 

InP 

5.78 

2640 

220 

548 

768 

520 

AIAS 

5.90 

3000 

250 

— 

— 

505 

GaAs 

6.03 

3000 

256 

750 

1006 

580 

InAs 

5.64 

2340 

195 

469 

664 

430 

AlSb 

5.08 

2840 

237 

524 

761 

400 

GaSb 

5.36 

2180 

182 

553 

735 

450 

InSb 

4.92 

1670 

139 

365 

504 

230 

ZnS 

5.14 

3000 

250 

515 

765 

— 

CdS 

4.77 

2750 

229 

288 

517 

— 

HgS 

5.68 

2590 

216 

— 

— 

— 

ZnSe 

4.92 

2500 

209 

462 

671 

137 

CdSe 

4.59 

1890 

157 

254 

411 

_ 
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Table  16.  (Continued) 


HgSe 

4.523 

1840 

154 

232 

386 

ZnTe 

4.42 

1720 

143 

374 

517 

82 

CdTe 

4.01 

1350 

113 

222 

335 

60 

HgTe 

3.99 

1360 

113 

230 

343 

25 

a.  TheUvaluesarineV,  andtheotheisinkg/mm^. 
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A  Green’s-function  method  has  been  used  to  study  the  surface  and  interface  electronic  structures 
of  the  Il-VI  compounds  HgTe  and  CdTe.  Localized  surface  and  resonance  states  near  the  cation- 
terminated  (100)  surface  of  CdTe  and  the  anion-terminated  surface  of  HgTe  have  been  found  for  the 
ideal  surfaces.  The  energies  and  strengths  of  these  surface  states  are  altered  by  surface  perturba¬ 
tions.  The  bulk  states  near  the  surface  are  drastically  modified  by  the  creation  of  the  surface,  but 
the  band  gaps  remain  unchanged.  Numerical  evaluation  of  the  local  densities  of  states  at  the  T  and 
J  points  shows  that,  at  the  (1(X))  interface  of  HgTe/CdTe,  the  previously  observed  surface  states  are 
no  longer  present.  However,  in  the  interface  region,  bulk  states  of  one  material  penetrate  some  dis¬ 
tance  into  the  other  material- 


I.  INTRODUenON 

One  of  the  earliest  formulations  used  in  the  investiga¬ 
tion  of  surfaces  and  interfaces  was  based  on  the  linear 
combination  of  atomic  orbitals  (LCAO)  method. 
Goodwin'  first  applied  the  LCAO  model  to  study  the  ex¬ 
istence  conditions  for  localized  Tamm^  states  in  a  crystal. 
He  found  that  these  states  occur  when  the  diagonal 
Coulomb  integrals  and  the  off-diagonal  resonance  in¬ 
tegrals  of  the  surface  are  allowed  to  be  different  from 
those  of  the  bulk,  for  systems  of  single-level  or  of  sp- 
hybridized  atoms.  At  the  same  time,  Shockley^  investi¬ 
gated  a  periodic  potential  that  is  terminated  at  its  max¬ 
imum  and  found  that,  under  the  condition  that  the  bulk 
bands  crossed,  surface  states  exist  in  the  middle  of  the 
band  gap.  Shockley  states,  as  they  have  come  to  be 
known,  come  into  being  when  the  perturbations  at  the 
surface  are  small  in  comparison  to  the  widths  of  the  al¬ 
lowed  energy  bands.  Koutecky*  and  others’  generalized 
Goodwin’s  model  to  study  the  energy  and  existence  con¬ 
ditions  of  surface  states.  Electronic  interface  states  were 
studied  within  the  LCAO  method  by  Davison  and 
Cheng.’  An  exact  tight-binding  solution  for  the  surface 
and  interface  problems  of  a  one-dimensional  semiconduc¬ 
tor  was  obtained  by  Dy  and  co-workers.^'* 

Since  these  early  investigations,  many  other  papers 
have  appeared  in  the  literature  for  the  study  of  suiface 
and  interface  properties  of  solids.  Kalkstein  and  Soven 
(KS)  (Ref.  9)  introduced  a  Green's-function  (GF)  formal¬ 
ism  to  study  the  surface  electronic  properties  of  semi¬ 
infinite  crystals.  This  is  a  relatively  simple  but  powerful 
method  by  which  both  the  bulk  and  surface  properties  of 
a  semi-infinite  crystal  can  be  studied.  The  method  can  be 
generalized  in  a  straightforward  manner  to  study  the 
electronic  properties  of  an  interface  formed  by  joining 
two  semi-infinite  crystals.  Because  of  its  simplicity  and 
power  the  method  of  KS  was  widely  applied  in  many  cal¬ 
culations  during  the  decade  following  its  develop¬ 


ment.“'~  For  the  same  reasons  we  apply  this  method  in 
this  work  to  investigate  surface  and  interface  electronic 
structures  of  the  pure  II-VI  compounds  CdTe  and 
HgTe.”'” 

Despite  the  intense  interest  in  Hgg.^Cd^iTe  over  the 
past  three  decades  due  to  its  application  in  the  prepara¬ 
tion  of  infrared  detectors,  there  have  been  relatively  few 
experimental  and  theoretical  investigations  of  the  surface 
and  interface  properties  of  this  system.”"”  Since  the 
KS  method  is  well  suited  for  the  study  of  the  surface 
properties  of  a  system  described  by  a  tight-binding  Ham¬ 
iltonian,  we  use  here  a  Hamiltonian  closely  related  to  that 
obtained  by  Hass,  Ehrenreich,  and  Velicky  (HEV)  (Ref. 
20)  for  HgTe  and  CdTe  in  the  empirical  tight-binding  ap¬ 
proximation  (ETBA)  based  on  the  LCAO  interpolation 
scheme  of  Slater  and  Koster.^'  In  the  ETBA  the  predict¬ 
ed  band  structures  of  the  pure  compounds  HgTe  and 
CdTe  are  matched  to  experimentally  determined  band  en- 
ergies.^^ 

In  Sec.  II,  we  introduce  the  formalism  of  KS  for  the 
description  of  the  surface  properties  of  II-VI  materials 
along  with  the  extension  of  the  technique  to  the  problem 
of  interfaces  of  these  materials.  Section  III  contains  the 
results  of  our  calculation  and  a  discussion. 

n.  FORMALISM 

In  the  formalism  of  Kalkstein  and  Soven,*  a  pair  of 
semi-infinite  crystals  is  formed  by  introducir.;  a  cleavage 
plane  into  an  infinite  crystal  in  one  crystallographic 
direction.  A  GF  describing  the  electronic  properties  of 
the  semi-infinite  systems  is  derived  from  the  GF  of  the 
infinite  crystal  and  a  scattering  potential  representing  the 
cleavage.  When  combined  with  a  tight-binding  formal¬ 
ism  in  which  nearest-  and  next-nearest-neighbor  interac¬ 
tions  are  included,  the  scattering  potential  is  relatively 
easy  to  calculate  making  application  to  realistic  systems 
simple.  The  Hamiltonian  is  constructed,  following  HEV, 

782S  ©1989  The  American  Physical  Society 


7826 


J.  T.  SCHICK,  S.  M.  BOSE,  AND  A.-B.  CHEN 


40 


for  and  CdTe  using  sp^  basis  states  and  including 

spin.  The  parameters  used  are  those  of  Slater  and 

Koster^'  as  determined  by  HEV  (Ref.  20)  except  that,  in 
our  case,  the  values  of  Esx  (110)  and  Esx  (01 1 )  are  inter¬ 
changed.  This  produces  band  structures'^  '^  that  are 
qualitatively  similar  to  those  of  Bryant  which  offer  im¬ 
proved  conduction-band  mass."  We  calculate  the  GF,  G 
associated  with  this  Hamiltonian  by  using  the  defining 
equation 

G={E+ik-Hr',  (1) 

where  H  is  the  Hamiltonian  of  the  infinite  unperturbed 
system.  It  is  understood  that  G  is  to  be  calculated  in  the 
limit  as  the  positive  imaginary  part  k  approaches  zero. 
These  calculations  are  performed  in  k  space,  utilizing  ful¬ 
ly  the  periodicity  of  the  crystal. 

For  the  surface  (interface)  calculations,  since  we  no 
longer  have  translational  periodicity  along  the  direction 
perpendicular  to  the  surface  (interface),  we  cannot  use  an 
ordinary  Ic-space  representation.  Kalkstein  and  Soven* 
assume  periodic  structure  parallel  to  the  surface  and  use 
a  representation  consisting  of  states  which  are  localized 
on  planes  of  atomic  sites  parallel  to  the  surface  and 
represented  by  the  index  n,  and  of  Bloch  states  reflecting 
the  periodic  symmetry  within  the  planes  which  .''re 
represented  by  the  index  k|,.  This  is  the  mixed  or  Bloch- 
Wannier  representation.  We  assume  the  same  type  of 
symmetry  in  the  interface  system.  The  Hamiltonian  and 
the  GF  of  the  bulk  crystal  as  well  as  the  Hamiltonians  of 
the  semi-infinite  and  interface  systems  must  all  be  ex¬ 
pressed  in  this  mixed  representation.  The  formulas  for 
the  semi-infinite  system  were  derived  by  KS  and  are  the 
same  for  us  if  we  reinterpret  the  algebraic  expressions  as 
matrix  equations  for  the  sp’-spin  basis  states.  Note  that 
in  this  paper  we  examine  the  surfaces  and  interfaces  per¬ 
pendicular  to  the  [100]  direction  for  pure  CdTe  and 
HgTe  samples.  For  these  compounds,  this  structure  cor¬ 
responds  to  an  arrangement  in  which  the  anions  and  cat¬ 
ions  are  placed  in  alternating  planes  parallel  to  the  sur¬ 
face  or  interface.  For  notational  simplicity,  in  the  follow¬ 
ing,  a  cation-anion  pair  of  planes  is  given  a  single  layer 
index  n,  with  the  species  index  v  left  to  distinguish  be¬ 
tween  the  two  species  (layers). 

For  basis  states  of  the  infinite  system,  KS  used  states 
analogous  to  laovk),  where  a=s,x,y,z  indicates  the 
atomic  orbital,  the  spin  ( T ,  U  is  represented  by  a,  v=a  or 
c  stands  for  anions  or  cations,  respectively,  and  k  is  the 
wave  vector.  Following  KS,  we  write  the  Bloch-Wannier 
states  for  our  system  as 

|aavnk|,>  =  -^  2e  ''' *”'|a<7vk  >  ,  (2) 

where  R„„  is  the  position  of  the  ion  subiattice  represented 
by  V  on  the  plane  labeled  by  n,  and  ku  and  k^  are  the  com¬ 
ponents  of  k,  parallel  and  perpendicular  to  the  surface 
(interface)  plane,  respectively.  This  basis  set  reflects  the 
symmetry  of  the  semi-infinite  system  and  is  therefore  well 
suited  for  our  purpose.  In  the  following  we  suppress  the 
k|,  index  for  compactness  of  notation,  as  in 


G(#i,n')=  2  2  ^oavnkf^\G\a'a'v'n'k.^) 

aav  a’oW 

X  |a<rv/ik|,><a'aVn'kJ  ,  (3) 

which  is  the  GF  submatrix  between  layer  n  and  layer  n 
From  the  site-diagonal  GF,  Gln,n),  we  may  calculate 
the  local  density  of  states 

p„,,(£)=  — (l/ir)ImTr,G(n,/i)  ,  (4) 

where  Tr„  indicates  that  the  trace  is  to  be  carried  out 
only  for  the  species  (layer)  given  by  v. 

The  GF  of  the  semi-infinite  crystal,  G'  is  found 
through  the  application  of  the  Dyson  equation, 

G’  =  G-I-GFG'.  (5) 

Besides  calculating  the  matrix  elements  of  the  infinite 

crystal  GF,  G,  we  must  also  find  the  matrix  elements  of 
the  scattering  potential,  V,  introduced  by  the  creation  of 
the  surface.  We  label  the  double  layers  in  the  crystal 
with  integers  such  that  the  cleavage  plane  passes  between 
the  double  layers  —  1  and  0,  and  the  semi-infinite  crystal 
of  interest  starts  at  the  zeroth  layer  and  has  layer  indices 
fi  >0.  Because  G'(n,n')  is  zero  for  all  n  and  n'  less  than 
zero,  the  only  matrix  elements  of  the  scattering  potential 
that  enter  into  Eq.  (5)  in  the  present  model  are  F(  — 1,0) 
and  K(0,  —  1)  which  express  the  severing  of  the  interac¬ 
tions  across  the  cleavage  plane.  We  may  also  include  a 
diagonal  term  F(0,0),  if  we  wish,  to  allow  for  surface 
perturbations  such  as  relaxation  and  environmental 
shifts,  into  our  calculation.  These  scattering  potentials 
have  explicit  dependence  on  k|;  and  the  pair  of  indices  as¬ 
sociated  with  V  refer  to  the  double  layers  involved  in  the 
interaction.  In  terms  of  the  Hamiltonian  matrices,  the 
scattering  potential  matrix  is  given  by 

V=H’-H  ,  (6) 

where  H’  is  the  Hamiltonian  of  the  semi-infinite  crystal. 
The  scattering  potential  matrix  elements  describing  the 
breaking  of  interactions  across  the  cleavage  plane  are^ 

K(0,-l)=-tf(0,-l)  , 

(7) 

F(-1,0)=-W(-1,0)  , 

where,  as  before,  the  k,  index  has  been  suppressed  for 
compactness  of  notation.  The  matrix  element  K(0,0)  is 
introduced  parametrically  to  account  for  the  shifts  in  the 
atomic  levels  and  hopping  interaction  at  the  surface  lay¬ 
er. 

Explicitly,  a  general  matrix  of  G'  is  found  from  Eq.  (5) 

as 

G'{n,n‘)=G(n,n') 

+  (G(n,0)F(0,0) 

+  G(n,-l)F(-l,0)]G'(0,n')  ,  (8) 

with  n  and  n’^O.  To  solve  this  equation,  it  is  first  neces¬ 
sary  to  solve  for  G'(0,n')  which  is  done  by  setting  n  equal 
to  zero  in  Eq.  (8)  and  solving  for  the  matrix  element 
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G'(0,n')  =  (l-G(0,0)K(0,0) 

-G(0,-l)F(-l,0)]’'G(0,n')  .  (9) 

To  find  the  general  matrix  element  of  G'  we  need  only  to 
substitute  Eq.  (9)  into  Eq.  (8).  To  include  environmental 
effects  on  the  surface  atoms,  we  introduce  a  parameter  5 
which  measures  a  shift  of  the  surface  layer  on-site  ener¬ 
gies.  Also,  we  introduce  a  parameter  d  which  represents 
the  strength  of  the  interactions  between  the  first  two 
planes  of  atoms  in  the  semi-infinite  crystal  relative  to  the 
same  interactions  in  the  bulk.  The  parameters  b  and  d 
are  included  in  the  matrices  K(0,0)  and  F(  —  1,  —  1 ). 

This  model  was  first  extended  to  the  interface  between 
two  model  semi-infinite  one-band  crystals  by  Yaniv.'^ 
We  further  extend  the  technique  to  real  crystals  using  the 
sp^  with  spin  basis  states.  The  interface  is  formed  be¬ 
tween  crystal  A  in  'S  —  l)  and  crystal  B  in  1 0).  In  joining 
the  crystals,  the  interactions  between  the  Te  and  Cd 
planes,  as  well  as  those  between  the  Te  planes,  across  the 
interface,  are  taken  to  be  the  same  as  in  the  bulk.  Since 
there  are  no  data  available  for  the  hopping  integrals  be¬ 
tween  Hg  and  Cd  planes  we  assume  them  to  be  equal  to 
the  average  of  the  interactions  between  bulk-crystal  (100) 
Cd  planes  in  CdTe  and  interactions  between  bulk-crystal 
(1(X))  Hg  planes  in  HgTe.  The  interactions  between  the 
A  and  B  sides  of  the  interface  are  included  in  the  scatter¬ 
ing  potentials  F'(0,  —  1 )  and  I"(  — 1,0).  To  our  Green’s 
function  we  add  subscripts  a  and  P  which  take  on  the 
values  A  and  B  to  indicate  the  side  of  the  interface  in¬ 
volved  in  the  calculations.  Once  again  solving  the  Dyson 
equation  for  the  interface  GF,  g  (Ref.  16), 

g=G'  +  C'y'g 

we  find  the  interface  GF  matrix  elements  as 
g^(m,n)  =  G'Jm,n)b„p 

+  Gi(m,0)r(0,  -  1  )g  V  - 

-(■G;,(m,-l)r(-l,0)gjp(0,/i)6„^  ,  (10) 

where 

-<?;,( -l,-l)F(-l,0)Gi(0,0) 
XF(0,-1)]-'g:,(-1,/.)  ,  (Da) 
gaa(0,n)  =  [l-Gi(0,0)F(0,-l)G:,(-l,-l) 

XF(-l,0)]‘'Gi(0,n) ,  (lib) 

gB^(0,/i)  =  Gi(0,0)F(0,-l)g^^(-l,n)  ,  (11c) 

g^,(-l,n)=G:,(-l,-l)F(-l,0)g„(0,n)  ,  (lid) 

and  6^  is  the  Kronecker  delta.  We  have  assumed  a  per¬ 
fect  match  in  the  lattice  spacing  across  the  interface 
which  is  nearly  true  for  HgTe  and  CdTe  with  3  difference 
in  spacing  of  only  0.3%. 

In  the  evaluation  of  the  GF  of  the  bulk  system  a  great 
deal  of  numerical  integration  is  required.  The  integrals 
are  evaluated  through  the  use  of  a  Lagrange  interpolation 
scheme  in  which  the  time  to  calculate  the  band  structures 
is  reduced  by  evaluating  them  exactly  at  only  a  few  points 


in  k  space  and  interpolating  for  the  intermediate  values.^* 
Accuracy  is  ensured  through  sampling  the  functions  at  a 
sufficient  number  of  points.  All  calculations  are  per¬ 
formed  with  a  small  positive  imaginary  component  in  the 
energy  and  the  final  results  ai  the  real  energy  axis  are 
evaluated  through  the  use  of  the  analytic  continuation 
procedure  of  Hass,  Velicky,  and  Ehrenreich.^’ 

One  quite  useful  feature  of  our  calculation  is  that  the 
evaluation  of  the  GF’s  of  the  semi-infinite  crystal  and  the 
interface  requires  much  less  computational  effort  than  the 
evaluation  of  the  infinite  crystal  GF.'**  As  a  result  of  this 
speed,  we  are  able  to  consider  several  values  of  the  envi¬ 
ronmental  shift  and  surface-nearest-neighbor  interac¬ 
tion.  In  order  to  observe  the  effects  of  the  variation  of 
these  parameters  on  the  localized  states  we  examine  the 
local  densities  of  states  (LDOS)  at  various  values  of  the 
parameters  for  a  few  CdTe  and  HgTe  surfaces  and  inter¬ 
faces.  These  LDOS  are  evaluated  at  fixed  values  of  k,  so 
that  we  may  find  the  positions  of  the  localized  state  bands 
in  the  surface  Brillouin  zone.  For  details  of  the  evalua¬ 
tion  of  the  matrix  elements  of  the  infinite  crystal  GF  and 
those  of  the  scattering  potentials  required  for  our  calcula¬ 
tion,  one  is  referred  to  Refs.  IS  and  16. 


Ei»rgy(»V) 

FIG.  1.  Dependence  of  the  LDOS  on  the  parameters  6  and  d 
as  compared  to  the  infinite  crystal  for  (a)  the  cation  surface  lay¬ 
er  and  (b)  the  first  interior  anion  layer  at  T  (k  =0)  for  CdTe 
cation-terminated  (100)  surface. 
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III.  RESULTS  AND  DISCUSSION 

Once  the  matrix  elements  of  the  infinite  crystal  GF  and 
the  relevant  scattering  potentials  are  specified,  one  can 
calculate  the  matrix  elements  of  the  GF  of  the  semi¬ 
infinite  sample  and  that  of  the  sample  with  the  interface 
using  Eqs.  (8)  and  (10),  respectively.  One  can  further  cal¬ 
culate  the  LDOS  at  various  layers  using  an  equation  of 
the  type  of  Eq.  (4).  In  this  paper  we  present  the  LDOS 
calculated  at  the  symmetry  points  F  and  J  and  at  ener¬ 
gies  near  the  valence-  and  the  conduction-band  edges  of 
the  sample  where  the  principal  band  gaps  occur. 

In  Figs.  I  and  2  we  plot  the  surface  LDOS  at  the  F 
point  for  various  combinations  of  the  environmental  shift 
parameter  6  and  the  geometric  shift  parameter  (relaxa¬ 
tion  parameter)  d  for  CdTe  cation-terminated  and  HgTe 
anion-terminated  crystals,  respectively.  We  begin  by  not¬ 
ing,  in  Fig.  1,  the  existence  of  a  localized  surface  state 
(bound  state)  which  is  the  sharp  structure  that  appears  in 
the  LDOS  within  the  band  gap  of  the  CdTe,  whether  or 
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FIG.  2.  Dependence  of  the  LDOS  on  the  parameters  8  and  d 
as  compared  to  the  infinite  crystal  for  (a)  the  anion  surface  layer 
and  (b)  the  first  interior  cation  layer  at  F  (h||=0)  for  HgTe 
cation-terminated  (100)  surface. 


not  there  is  a  surface  perturbation.  The  ideal  cation- 
truncated  (100)  surface  of  CdTe  has  a  surface  state  at  F 
with  energy  lying  at  £  =0.6  eV  above  the  top  of  the 
valence  band.  We  also  see  that  the  bulk  densities  of 
states  are  modified  as  a  result  of  the  introduction  of  a  sur¬ 
face  but  the  band  gap  of  CdTe  remains  unchanged.  We 
note  that  the  ptosition  of  the  bound  state  is  sensitive  to 


FIG.  3.  Dependence  of  the  LDOS  on  the  parameters  6  and  d 
as  compared  to  the  infinite  crystal  for  (a)  the  anion  surface  layer 
and  (b)  the  first  interior  cation  layer  at  J  [k,  =  ir/a(  v'l,  v/2)]  for 
HgTe  anion-terminated  (100)  surface. 
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variations  in  both  d  and  5.  In  Fig.  2,  we  notice  that  the 
surface  has  again  introduced  a  marked  change  in  the 
LDOS  of  HgTe.  A  resonance  state  appears  in  the  form  of 
a  spike  inside  the  conduction  band  of  semimetal  HgTe. 
The  contribution  of  this  resonance  to  the  LDOS  is 
strongest  for  anion  layers  indicating  that  anion  states  are 
the  most  likely  constituents  of  this  state.  Similar  surface 
states  have  been  calculated  by  Bryant'^  for  the  special 
case  where  no  surface  perturbations  exist. 

The  surface  states  calculated  for  the  HgTe  at  the  J 
point  where  there  exists  a  gap  are  shown  in  Fig.  3.  These 
states  exhibit  the  same  dependence  on  the  interactions 
that  was  seen  at  the  F  point.  Here  instead  of  a  single 
peak  we  see  a  pair  of  bound  states  just  above  the  valence 
band  within  the  band  gap.  Also  above  the  conduction 
band  there  is  a  bound  state  that  was  not  observed  for  the 
r  point.  Anions  contribute  more  strongly  to  the  bound 
states  inside  the  band  gap  while  the  cation  contribution  is 
stronger  to  the  bound  state  above  the  conduction  band. 
Also  notice  that  the  bulk  states  are  strongly  modified  by 
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FIG.  4.  Comparison  of  the  surface  LDOS  of  the  cation- 
terminated  (100)  surface  of  CdTe  with  the  LDOS  plotted  (a)  at 
successive  cation  layers  and  (b)  anion  layers  for  the  CdTe  side  of 
the  (100)  interface  CdTe/HgTe  at  T. 


the  introduction  of  the  surface  and  that  the  band  gaps  are 
not  altered  by  the  surface. 

Our  calculation  of  the  layer  dependence  of  the  surface 
states  shows  that  these  states  become  progressively  less 
significant  as  we  examine  deeper  inside  the  crystal  indi¬ 
cating  these  states  are  indeed  localized  bound  states.  The 
bulk  LDOS  in  the  deeper  layers,  on  the  other  hand,  start 
resembling  the  infinite  crystal  LDOS. 

As  mentioned  before,  the  interface  calculations  have 
been  performed  assuming  no  lattice  mismatch  in  the  crys¬ 
tal  structures  of  the  HgTe  and  CdTe  compounds,  thus 
neglecting  all  strains  that  may  be  present  at  the  interface. 
The  parameters  chosen  for  our  calculation  also  do  not  al¬ 
low  for  valence-band  offset.  With  these  assumptions, 
dramatic  effects  are  seen  at  the  interface  of  HgTe  and 
CdTe  as  shown  in  Figs.  4  and  S,  where  the  LDOS  in  the 
CdTe  side  of  the  interface  are  presented  at  F  and  J 
points,  respectively.  The  most  obvious  effect  is  the  lack 
of  localized  states  that  were  previously  seen  at  the  free 
surfaces  of  these  materials.  Instead,  we  find  that,  close  to 
the  interface,  the  bulk  states  of  HgTe  that  lie  in  the  re¬ 
gion  between  0  and  1.6  eV  appear  in  the  band  gap  of 


FIG.  5.  Comparison  of  the  surface  LDOS  of  the  cation- 
terminated  (lOOi  surface  of  CdTe  with  the  LDOS  plotted  (a)  at 
first  cation  layer  at  the  interface  and  (b)  the  first  interior  anion 
layer  for  the  CdTe  side  of  the  (100)  interface  CdTe/HgTe  at  J. 
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CdTe  at  the  F  point.  These  states  in  the  CdTe  gap  be¬ 
come  less  significant  at  layers  further  from  the  interface 
while  at  the  same  time  the  LDOS  in  the  energy  ranges  of 
the  conduckion  and  valence  bands  of  CdTe  increase  in 
strength.  Thus  we  have  a  narrowing  of  the  gap  of  CdTe 
in  a  limited  region  near  the  interface  which  allows  con¬ 
duction  electrons  to  penetrate  into  the  CdTe  from  the 
HgTe  over  a  short  distance.  In  Fig.  5,  we  explicitly  see 
how  the  gap  at  the  CdTe  layers  is  reduced  with  respect  to 
pure  CdTe  at  J.  Calculation  of  the  LDOS  in  the  HgTe 
side  of  the  interface  shows  similar  effects. 

The  above  results  describe  how  the  LDOS  change  from 
the  bulk  to  the  surface  and  then  from  the  surface  to  the 
interface.  While  the  bulk  and  the  interface  results  can  be 
considered  realistic,  the  surface  results  may  not  be,  since 
the  surface  reconstruction  has  not  been  included  in  the 
calculation.  Recent  experiments”-^*”’*  and  a  structural 
theory^^  indicate  that  the  surfaces  of  CdTe  and  some  oth¬ 
er  II-VI  compounds  undergo  reconstructions  similar  to 
those  on  the  surfaces  of  III-V  compounds.  However, 
similar  measurements  are  yet  to  be  made  on  HgTe  sur¬ 
faces  and  CdTe/HgTe  interfaces.  Our  calculations  indi¬ 
cate  that  changes  in  the  electronic  structure  in  the  form 
of  environmental  shifts  at  the  surface  lead  to  only  minor 
changes  in  the  localized  surface  and  the  bulk  LDOS. 
Whether  a  surface  reconstruction  will  result  in  substan¬ 
tial  modification  of  these  states  is  yet  to  be  determined. 

In  conclusion,  we  have  described  in  this  paper  how  one 
can  incorporate  the  GF  method  of  KS  to  calculate  the 
surface  and  interface  structures  of  II-VI  compounds 


without  having  to  deal  with  the  artificial  super-cell  ap¬ 
proximation.  Our  calculation  shows  that  drastic  changes 
in  the  LDOS  can  occur  because  of  creation  of  surfaces 
and  interfaces.  Localized  surface  or  resonance  states  ap¬ 
pear  above  the  top  of  the  valence  band,  and  the  effects  of 
the  surface  on  the  LDOS  persist  at  several  layers  inside 
the  bulk.  At  the  interface,  there  are  no  localized  states 
but  the  bulklike  states  related  to  one  material  penetrate 
into  several  layers  inside  the  other  material  resulting  in 
narrowing  of  band  gaps  close  to  the  interface  for  the  wide 
band-gap  material.  At  present  there  are  no  systematic 
experimental  data  available  for  comparison  with  our  con¬ 
clusions. 

Finally,  even  though  we  have  not  included  all  aspects 
necessary  for  a  complete  description  of  the  surfaces  and 
interfaces,  our  work  can  be  considered  to  be  the  first  step 
toward  the  understanding  of  the  surface  and  interface 
electronic  structures  of  the  II-VI  compounds.  To  our 
knowledge,  the  results  presented  here  are  new  for  the  in¬ 
terface  and  more  general  than  any  previous  surface  calcu¬ 
lations  on  the  II-VI  compounds.  Since  the  method  is 
flexible,  it  will  be  possible  to  incorporate  realistic 
structural  models  dealing  with  the  surface  reconstruc¬ 
tions,  when  they  become  available,  to  calculate  more  real¬ 
istic  electronic  properties  of  such  surfaces  and  interfaces. 
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IV  ALLOY  STATISTICS  AND  PHASE  DIAGRAMS 


4.1  Mixing  Free  Energy.  Miscibility  Gao  and  Order-Disorder 

Transitions 

Consider  an  ideal  pseudobinary  semiconductor  alloy  Ai-xBxC  in  the 
zincblende  structure  in  which  the  alloy  atoms  A  and  B  randomly  occupy 
their  fee  sublattice  sites  while  the  C  atoms  occupy  the  other  fee 
sublattice.  If  the  C  atoms  are  treated  as  spectators,  then  this 
pseudobinary  alloy  behaves  like  a  fee  binary  alloy  Ai-xBx  in  the 
statistical  mechanics  formalism.  However,  as  evidenced  by  the  EXAFS 
experiment  discussed  in  Chapter  I,  the  atomic  positions  in  a  real 
semiconductor  alloy  are  distorted  slightly  from  the  zincblende  sites. 
Because  the  bonding  is  covalent  the  C  atoms  mediate  the  energies  and  in 
that  sense  affect  the  statistics.  It  will  become  clear  later  that  the 
distribution  of  A  and  B  atoms  in  an  alloy  is  never  completely  random. 

This  has  important  consequences  on  many  physical  properties.  For 
example,  a  knowledge  of  structural  energies  and  associated  atomic 
distributions  is  essential  to  accurate  calculations  of  phase  diagrams. 

In  this  chapter  we  will  be  dealing  with  equilibrium  statistics  and  phase 
diagrams.  The  equilibrium  state  at  a  fixed  temperature  T  and  pressure  P 
is  the  one  with  the  minimum  Gibbs  free  energy.  The  equilibrium  state  may 
contain  only  one  phase,  for  example,  an  ordered  alloy  in  a  given  crystal 
structure,  or  a  disordered  solution  with  a  uniform  concentration.  It  may 
also  contain  several  phases,  for  example,  an  ordered  alloy  plus  a 
disordered  one,  or  two  disordered  phases  with  different  concentrations. 
Gibbs'  phase  rule  (e.g.,  Landau  and  Lifshitz,  1986)  states  that  a  solution 
containing  n  species  can  have  up  to  a  maximum  of  n  +  2  phases  coexisting 
in  equilibrium.  Thus  to  determine  the  equilibrium  state  at  constant 
temperature  and  pressure  of  an  alloy  requires  knowledge  of  the  Gibbs  free 
energies  of  all  possible  phases. 

Under  normal  pressures  of  the  order  of  one  atmosphere,  the  difference 
between  the  Gibbs  free  energy  G  and  the  Helmholtz  free  energy  F,  G-F  - 
PV,  is  insignificant  for  a  solid  or  liquid.  Thus,  it  is  sufficient  to  use  F  for 
most  cases.  For  a  disordered  zincblende  pseudobinary  alloy  Ai-xBxC,  it  is 
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convenient  to  define  a  mixing  free  energy  AF  as  a  function  of  x  and  T  at  a 
fixed  pressure 

AF(x,T)  =  F(x,T)  -  (1  -  x)Fac(T)  -  xFbc(T).  (4.1.1) 

where  F,  Fac  and  Fee  are  respectively  the  Helmholtz  free  energies  for  the 
alloy,  pure  AC  and  pure  BC  compounds  containing  the  same  number  of  C 
atoms.  aF  can  be  written  as 


AF  =  AE-TAS,  (4.1.2) 

where  AE  is  the  mixing  energy  and  AS  is  the  mixing  entropy  defined 
similarly  to  AF.  Again,  because  the  magnitude  of  A(PV)  is  small,  AE  and 
the  mixing  enthalpy  AH  are  used  interchangeably.  If  AE  >  0  at  T  -  0,  which 
turns  out  to  be  the  case  for  most  semiconductor  alloys,  then  AF  as  a 
function  of  x  at  different  fixed  T  will  have  the  schematic  shapes  of  the 
curves  shown  in  Fig.  4.1.1a.  The  curve  labelled  T3  >  Tc  represents  a  typical 
high  temperature  curve.  It  tells  us  that  disordered  alloys  with  any 
concentration  fraction  x  are  stable  at  this  temperature.  However,  the 
low-temperature  curve  labelled  T2  <  Tc  indicates  there  is  a  miscibility 
gap  for  x  within  the  interval  xi  <  x  <  X2.  Here  xi  and  X2  are  the  points  at 
which  the  common  tangent  line  touches  the  AF  curve.  This  curve  means 
that  the  alloy  is  thermally  stable  against  decomposition  for  x  ^  xi  and 
X  2  X2  but  not  inside  the  interval  xi  <  x  <  X2.  For  x  inside  the  gap,  the  alloy 
tends  to  decompose  into  two  alloys  with  concentrations  and  X2  with 
proportions  Pi  and  P2  governed  by  the  lever  rule,  Pi«  (x2-x)/(x2-xi )  and 
P2  “  (x  -  xi)/(x2-  XI ),  because  this  decomposition  minimizes  the  free 

energy.  There  are  two  other  special  concentration  values  and  x'2  called 
the  spinodal  points.  These  occur  at  inflection  points  of  AF  at  each  T  <  Tc, 

i.e.,  d^AF/dx^  =  0  at  these  x  values.  These  values  separate  x  space  into 

unstable  and  metastable  regions.  For  xi  <  x  <  x'j  or  Xj  <  x  <  X2  .  the  alloy  is 
metastable  against  local  decomposition,  because  the  AF  value  for  any  x  in 
these  regions  is  lower  than  the  lever  rule  average  value  of  AF  when  the 
alloy  begins  to  separate  into  domains  with  compositions  in  the 
neighborhood  of  x.  This  causes  a  barrier  to  the  alloy  decomposition  into 

its  equilibrium  state.  An  alloy  with  x'j<x<x^is  inherently  unstable 
because  there  is  no  decomposition  barrier.  We  can  construct  the  binodal 
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Figure  4.1.1  (a)  Schematic  picture  of  mixing  free  energies  AF  as  a 

function  of  composition  x  of  a  pseudobinary  alloy  Ai-xBxC  with  a 
positive  mixing  enthalpy  at  three  temperatures  Ti  -  0,  T2  <  Tc  and  T3  > 
Tq.  (b)  Schematic  miscibility  gap  and  spinodal  cunres.  Regions  I,  II, 
and  III  are  stable,  metastable  and  unstable  respectively. 


F-3 


and  spinodat  curves  by  continuously  varying  the  temperatures  and  tracing 

the  values  of  these  gaps,  as  shown  in  Fig.  4.1.1b.  These  gap  values  become 

closer  to  each  other  as  the  temperature  increases  until  a  critical 

/  / 

temperature  Tc  is  reached,  where  all  the  values  xi,  X2,  and  merge 

into  one  value.  Beyond  Tc,  the  disordered  phase  is  stable  for  all  alloy 
concentrations. 

If  aE  <  0  at  T  -  0,  then  there  is  a  tendency  to  form  a  long-range  ordered 
alloy.  Whether  the  alloy  is  ordered  or  disordered  depends  upon  the 
temperature.  Figure  4.1.2  schematically  compares  the  mixing  free  energy 
of  the  disordered  alloy  against  that  of  an  ordered  compound  ABC2-  With  a 
negative  mixing  enthalpy,  the  energy  of  the  ordered  system  AEo  is  lower 
than  the  disordered  alloy,  i.e.,  AEq  <  AE  at  T  «  0,  so  that  the  ordered  state 
is  the  equilibrium  state.  As  temperature  increases,  AF  of  the  disordered 
alloy  becomes  more  negative,  mainly  because  the  entropy  term  in  Eq. 

(4.1.2)  decreases  roughly  linearly  in  T.  The  system  remains  in  the  ordered 
phase  until  a  transition  temperature  To  is  reached  when  AF  is  equal  to 
AEq.  The  disordered  alloy  then  becomes  the  stable  state  for  temperatures 
greater  than  Tq. 

4.2  Analytical  Models 

Since  pseudobinary  alloys  Ai-xBxC  statistically  are  similar  to  binary  alloys 
Ai-xBx,  the  binary  results  will  be  used  whenever  they  do  not  cause  any 
confusion.  The  statistical  theory  for  binary  solid  solutions  is 
mathematically  equivalent  to  the  three-dimensional  Ising  model  in 
magnetism.  Even  for  the  simplest  case  with  nearest-pair  interactions,  an 
analytical  solution  of  this  model  remains  one  of  the  most  challenging 
problems  in  theoretical  physics.  However,  systematic  computational 
methods  are  available,  as  will  be  discussed  later.  This  section  briefly 
reviews  several  analytical  models,  which  are  useful  for  illustrating  the 
basic  concepts  and  for  semi-empirical  phase  diagram  evaluation.  This 
review  will  focus  on  disordered  alloys  with  positive  mixing  energies, 
because  this  is  the  most  relevant  case  for  most  bulk  semiconductors. 
However,  results  related  to  ordered  cases  will  also  be  discussed.  More 
detailed  examples  of  spontaneous  ordering  will  be  considered  in  Chapter  7. 
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4.1.2 


AE  <  0 


Figure  4.1.2  Schematic  picture  of  mixing  free  energies  AF  as 
function  of  alloy  composition  x  of  a  pseudobinary  alloy  Ai-xB 
negative  mixing  enthalpy  at  three  temperatures  Ti  ■  0,  T2  ■  T< 
>  Tc.  The  dotted  curve  is  for  an  ABC2  ordered  phase. 
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a 

;C  with  a 
and  T3 


Ideal-solution  Model 


.This  model  is  equivalent  to  the  case  where  the  mixing  energy  vanishes,  aE 
»  0  in  Eq.  (4.1.2).  Thus  in  this  approximation,  the  mixing  free  energy  AF 
results  totally  from  the  additional  entropy  arising  from  a  random 
arrangement  of  Na  A-atoms  and  Nb  B-atoms  on  their  N  lattice  sites  with  N 
*  Na  +  Nb  and  x  -  Nb/N.  The  number  of  different  arrangements  is 


The  corresponding  mixing  entropy  is  AS  «  kInOo.  Using  the  Stirling 
approximation  InN!  =  NlnN  -  N,  the  mixing  entropy  for  the  random  alloy  is 
obtained, 

AS  = -Nk[(l-x)  In(l-x)  +  xlnx]  .  (4.2.2) 

This  model  always  yields  negative  values  of  AF  «  -TAS  for  all  x  and  T.  As 
a  function  of  x,  AF  in  Eq.  (4.2.2)  is  always  concave  upward,  so  no 
miscibility  gap  exists  at  any  T. 


ZEROTH  APPROXIMATION 

When  aE  is  not  zero  but  the  mixing  entropy  is  still  set  equal  to  the  random 
alloy  result,  the  model  was  named  by  Guggenheim  (1952)  the  "strict- 
regular  solution"  model,  or  the  zeroth  approximation.  If,  following  shapes 
often  experimentally  observed  in  pseudobinary  alloys,  AE  is  assumed  to 
have  an  x  dependence  given  by 

AE  =  vx(l-x)0  (4.2.3) 

then  miscibility  gaps  exist  at  low  temperature  for  positive  molar 
interaction  parameters  fl.  In  Eq.  (4.2.3).  v  is  the  molar  number  v  =  N/Nq 
with  No  being  Avogadro's  number. 

Equation  (4.2.3)  as  we  shall  now  demonstrate,  is  the  expression  for  the 
mixing  energy  if  pair-wise  interaction  energies  and  a  random  distribution 
are  assumed.  Let  z  be  the  coordination  number.  Note  that  z  is  the  number 
of  nearest  neighbor  atoms  surrounding  a  site  in  the  binary  case,  but  it 
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represents  the  number  of  the  second-neighbor  atoms  in  a  pseudo-binary 
alloy.  For  example,  the  value  of  2  for  a  zincblende  alloy  is  12.  Then  the 
total  number  of  pairs  involving  the  A  and  B  atoms  is  M  «  zN/2.  Let  Maa. 
Mab  and  Mbb  be  the  number  of  AA,  AB,  and  BB  pairs  respectively,  and  let 
EAA.  CAB  and  EBB  be  the  corresponding  pair  interaction  energies.  We  further 
define  the  fractions  nj  »  Mjj/M,  Not  ail  these  pair  fractions  are 
independent.  Let  r  «  rAB.  then  rAA  »  1  -  x  -  0.5  r  and  rBB  -  x  -  0.5r.  The 
mixing  energy  is  governed  only  by  the  AB  pair  fraction  r, 


AE  =  MaaEaa  +  Mab£ab  -  (l-x)MeAA-  xMebb  =  Mre  (4.2.4) 

where  e  is  the  excess  energy  per  pair  and  is  defined  as 


e  =  Cab  -  (caa  +  Ebb)  /  2 .  (4.2.5) 

For  a  random  alloy  the  pair  probability  is  given  by 

r  =  2  (l-x)x,  (4.2.6) 

so  the  mixing  energy  has  the  form  of  Eq.  (4.2.3)  with  the  interaction 
parameter  given  by 

=  No  z£ .  (4.2.7) 

This  strict-regular  model  is  the  simplest  one  that  contains  some  aspect 
of  reality,  because  it  relates  a  positive  mixing  enthalpy  to  a  miscibility 
gap  and  critical  temperature  Tc-  For  example,  Tc  can  be  obtained 

explicitly  by  setting  d^AF  /  dx^  equal  to  zero  at  x  «  1/2,**  where  AF  *  AE  - 
TcAS  with  E  in  the  form  of  Eq.  (4.2.3)  and  AS  having  the  random  alloy 
expression  in  Eq.  (4.2.2).  The  result  is 

Tc  =  ^.  (4.2.8) 

where  R  *  No  k  is  the  universal  gas  constant. 


aF  has  its  secx>nd  derivative  vanish  at  x  -  1/2  in  this  case  because  of  the  assumed  symmetric 
forms  of  Eqs.  (2.2.2)  and  (2.2.3).  In  more  general  cases  that  extend  beyond  pair  interactions 
as  we  shall  see,  the  second  derivative  vanishes  at  x  values  of  the  composition  x  1/2. 
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First  Approximation  •  the  Quasi-Chemical  Approximation  (QCA) 

In  the  zeroth  approximation,  an  energy  term  is  added,  but  the  distribution 
of  A  and  B  atoms  is  still  constrained  to  be  random,  a  condition  that  is  not 
consistent  with  the  non-zero  excess  pair  energy  e.  Intuitively  one  expects 
that  the  pair  probability  r  for  the  AB  pair  should  be  smaller  than  the 
random  value  in  Eq.  (4.2.6),  if  e  is  positive;  and  vice  versa.  The  first 
approximation  in  Guggenheim's  notation  (1952)  corrects  this  flaw  in  an 
approximate  way.  The  correction  comes  from  the  mixing  entropy.  For  a 
random  alloy,  the  number  of  ways  of  configuring  the  A  and  B  atoms,  Oq,  is 
given  by  Eq.  (4.2.1).  For  a  specified  set  of  Maa.  Mab;  and  Mbb.  the  number  of 
ways  to  arrange  A  and  B  atoms  should  be  just  Oq  times  the  probability  of 
having  the  specified  set  of  numbers  of  pairs  (Sher  et  al.  1987) 


<D  = 


NL. 
Na!  Nb! 


■ _ Ml _ 

.MaaI  MabI  MbbI 


y2MAA  (2xy)^^ 


(4.2.9) 


In  Eq.  (4.2.9)  y  -  1-x  is  the  fractional  concentration  of  A  atoms.  The  term 
outside  the  bracket  is  the  total  distinguishable  number  of  ways  to  arrange 
Na  and  Nb  atoms  on  N  sites,  and  the  bracket  is  the  fraction  of  these  ways, 
i.e.,  the  probability,  Maa.  Mab.  and  Mbb  pairs  will  be  found  when  the 
concentration  of  B  atoms  is  x  o-  This  probability  is  the  number  of  ways  of 
arranging  the  Maa.  Mab.  and  Mbb  pairs  on  M  pair  sites  times  the  probability 

of  Maa  pairs,  i.e,  Mab  pairs,  and  Mbb  pairs.  This  expression  still 

neglects  the  correlations  between  pairs  but  we  shall  show  in  Section  4.10 
that  in  fact  it  does  not..  Equation  (4.2.9)  should  be  compared  with 
Guggenheim's  (1952)  combinatorial  formula 


.  _  jsji  (Mx^)!(Mxy)!(Mxy)!(My^)! 
^  "  Na!  Nb!  Maa!  Mab!  Mba!  Mbb!  ’ 


(4.2.10) 


Note  that  the  leading  terms  of  InO  and  InOo  in  the  Stirling  approximation 
are  the  same,  so  these  two  expressions  give  the  same  mixing  entropy. 
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However,  the  physics  in  Eq.  (4.2.9)  is  more  transparent  and  it  is  an  easier 
form  to  extend  beyond  the  pair  approximation. 

The  mixing  free  energy  with  O  given  by  Eq.  (4.2.9)  and  excess  energy  e 
given  by  Eq.  (4.2.5)  is 

AF  =  Mie  -  kTlnOo  +  MkT  [(x-r/2)  ln(x-r/2)  +  (y-r/2)  ln(y-r/2) 

+  rln(r/2)-(xlnx  +  ylny)]  .  (  •  •  ) 


The  value  of  r  for  an  equilibrium  distribution  is  the  one  that  minimizes  AF. 
Taking  the  partial  derivative  dAF/9rand  setting  it  equal  to  zero  leads  to  a 
quadratic  algebraic  equation  for  r.  The  proper  solution  is 


_ 4xy _ 

l+{l+4xy[exp(^)-l])^^^ 


(4.2.12) 


This  expression  reduces  to  the  correct  limits.  As  e  approaches  0,  r 
approaches  2xy,  which  is  the  random  limit  given  in  Eq.  (4.2.6).  For  large  e 
»  kT,  r  decreases  exponentially,  rae-e^kT,  as  expected.  If  e  is  negative, 
then  r  >  2xy,  and  for  large  enough  |e|  »  kT  the  system  is  eventually  driven 
into  compound  formation,  i.e.,  r  ^  2x  for  x  <  1/2,  or  r  2y  for  x  >  1/2. 

The  AB  pair  probability  r  in  Eq.  (4.2.12)  can  also  be  obtained  by  considering 
the  following  chemical  equilibrium 

AA  +  BB<^2AB.  (4.2.13) 

According  to  the  law  of  mass  action,  the  equilibrium  pair  fractions  obey 
the  following  relation 

_  4b  -  _  4  (4214) 

taaiBb  zaazbb  cCbbP  e^eP 


where  p  «  1/(kT)  and  the  Zjjs  are  the  partition  functions  for  the  indicated 
pairs.  Equation  (4.2.14)  reduces  to 

1-2  e2e/kT  =  -  2r  +  4xy  (4.2. 1 5) 
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The  proper  solution  for  r  from  this  equation  is  that  given  in  Eq.  (4.2.12).  It 
is  in  this  connection  that  the  first  approximation  is  often  referred  to  as 
the  quasi-chemical  approximation  (QCA). 

QCA  represents  an  improvement  over  the  zeroth  approximation  because  in 
it  the  correlation  of  the  distribution  of  A  and  B  atoms,  called  the 
correlation  state  of  the  alloy  depends  on  the  pair  energies.  The  level  of 
improvement  can  be  appreciated  by  comparison  with  the  exact  analytical 
solutions  in  two  dimensions.  The  binary  alloy  statistical  problem  with 
pair  interactions  is  equivalent  to  the  magnetic  ising  model.  For  e  >  0,  the 
alloy  problem  can  be  directly  translated  into  the  ferromagnetic  Ising 
model.  The  only  difference  is  that  while  the  alloy  concentration  %  's 
specified,  x  in  the  magnetic  problem  is  further  varied  to  minimize  AF. 

Thus  above  Tc  and  with  pair  interactions  the  minimum  of  AF  always 
occurs  at  X  «  1/2,  which  in  the  magnetic  case  means  no  net  spin  or 
magnetic  moment.  However,  for  temperatures  below  Tc,  AF  has  two 
minima  occurring  at  x  values  xi  and  X2  corresponding  to  the  miscibility 
gaps  indicated  in  Fig.  4.1.1.  For  magnetism,  this  situation  corresponds  to 
having  a  net  magnetic  moment.  Thus  the  alloy  spinodal  critical 
temperature  Tc  is  equivalent  to  the  magnetic  phase  transition  critical 
temperature. 

The  value  of  Tc  in  the  pair  model  can  be  written  as 

Tc  =  -Q-,  (4.2.16) 

XR 

where  Q  is  the  interaction  parameter  Q  »  Noze  defined  in  Eq.  (4.2.7),  and  X 
is  a  numerical  factor  that  depends  on  the  model  used.  For  the  zeroth 
approximation  we  already  found  X,  -  2,  [see  Eq.  (4.2.8)].  The  zeroth 
approximation  is  equivalent  to  the  Bragg-Williams  (1954)  approximation 
to  the  Ising  model.  For  QCA,  the  X  value  is  given  by 

X  =  zln(^)  .  (4.2.17) 

This  result  can  be  obtained  by  starting  from  the  expression  for  AF  in  Eq. 
(4.2.11)  and  r  in  Eq.  (4.2.12),  and  then  taking  the  second  derivative  a^AF/3x2 
and  setting  it  equal  to  zero  at  x  -  1/2.  In  doing  so,  the  conditions  aAF/ar  =  0 
anddr/dx  =  0at  x  «  1/2  are  used.  An  explicit  derivation  of  a  more  accurate 
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expression  is  given  in  Sec.  4.7.  Note  that  QCA  for  alloys  is  equivalent  to 
the  Bethe-Peierls  approximation  (Bethe,  1935;  Huang,  1987)  to  the  Ising 
model. 

Table  4.2.1  compares  the  X  values  that  enter  the  Tc  formula,  Eq.  (4.2.17), 
for  a  square  lattice  from  different  approximations  against  the  exact 
solution  (Onsager,  1944)  and  for  a  fee  lattice  against  the  best  numerical 
value  available  (de  Fontaine,  1979).  The  results  obtained  from  the  cluster 
variation  method  (CVM)  to  be  discussed  in  Sec.  4.10  are  also  listed  for 
later  use.  Progressively  more  sophisticated  statistical  theories  result  in 


Table  4.2.1 

The  values  of  X  for  the  transition  temperature  Tc  -  n/(XR)  from  different 
calculations.  CVM  indicates  the  results  from  the  cluster  variation  method 
to  be  discussed  in  Sec.  4.10. 


Alloy, 

Ising  Model 

Zeroth  Approx. 
Bragg-Williams 

QCA. 

Bethe-Peirls 

CVM 

Exact  or  Best 

Result 

Square  Lattice 

2 

2.773 

3.5255 

fee 

2 

2.188 

2.438 

2.4501 

After  de  Fontaine  (1974),  Table  V. 


larger  X  values,  and  hence  smaller  critical  temperatures.  Although  QCA 
predicts  only  79%  of  the  exact  X  value  in  the  square  lattice  case,  it  gives 
89%  of  the  best  computer  simulated  value  for  the  fee  lattice.  Therefore 
not  surprisingly,  QCA  has  been  used  with  reasonable  success  in  semi- 
empirical  phase  diagram  calculations  for  semiconductor  compounds  and 
alloys,  as  will  be  discussed  in  the  next  few  sections. 
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4.3  Phase  Diagram;  Common  Tangent  Line  and  Activity 

Coefficient 


So  far  our  calculation  of  free  energies  has  been  confined  to  solid 
solutions.  To  find  the  phase  diagrams  one  also  needs  to  know  the  free 
energies  of  liquids  (or  melts).  Very  often,  liquids  are  treated  as  dense 
lattice  gases,  as  solids  with  a  high  concentration  of  vacancies,  or  simply 
as  a  disordered  system  with  some  effective  coordination  number.  The 
latter  treatment  has  been  used  with  some  success  in  the  calculation  of 
phase  diagrams  of  compound  semiconductors  and  alloys.  This  and  the  next 
sections  introduce  the  basic  ideas  governing  phase  diagrams  including  (1) 
the  liquidus  curve  between  a  binary  liquid  Ai-xBx  and  a  stoichiometric 
compound  AB,  e.g.,  between  a  Gai.xAsx  melt  and  stoichiometric  solid 
GaAs;  (2)  the  liquidus-solidus  curves  describing  the  equilibrium  between  a 
ternary  liquid  and  a  solid  pseudobinary  alloy;  and  (3)  the  general  ternary 
phase  diagrams.  The  basic  ideas  and  most  results  are  in  the  papers  of 
Vieland  (1963),  Stringfellow  and  Green  (1969),  Kikuchi  (1981  and  1982) 
and  Breberick  et  al.  (1983),  and  in  a  text  by  Casey  and  Panish  (1978). 

As  mentioned  earlier  the  equilibrium  state  at  a  given  P  and  T  is  the  state 
with  minimum  Gibbs  free  energy.  To  construct  a  phase  diagram,  we  first 
calculate  the  free  energy  curves  as  a  function  of  alloy  concentrations  for 
all  the  relevant  phases,  and  then  find  the  combination  of  phases  that 
minimizes  the  free  energy.  For  example,  consider  the  liquidus-solidus 
curves  between  a  pseudobinary  liquid  and  the  solid  solutions  Ai-xBx  C 
schematically  shown  in  Fig.  4.3.1a.  The  corresponding  free  energy  curves 
for  both  phases  as  a  function  of  x  for  a  given  temperature  Ti  are  drawn  in 
Fig.  4.3.1b.  Pay  attention  to  the  common  tangent  line  that  touches  the  two 
free  energy  curves  at  xi  and  X2.  This  picture  illustrates  the  cause  of  the 
equilibrium  state  behavior  as  a  function  of  the  average  alloy 
concentration  x  at  this  temperature.  For  x  less  than  xi  the  liquid  phase  is 
the  stable  phase,  and  for  x  >  X2  the  solid  solution  is  the  stable  phase.  For 
an  average  concentration  lying  between  xi  and  X2  the  stable  state  is  a 
mixture  of  a  liquid  phase  with  concentration  xi  and  a  solid  phase  of 
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concentration  X2  with  proportions  governed  by  the  level  rule.  The  two 
free  energy  curves  at  this  temperature  Ti  give  two  phase  boundary  points 
(denoted  by  L  and  S)  for  the  liquidus-solidus  curves  shown  in  Fig.  4.3.1a. 
The  full  liquidus-solidus  curves  are  generated  from  a  set  of  such  free 
energy  curves  derived  by  continuously  varying  the  temperature.  This 
procedure  for  minimizing  the  free  energy  using  the  common  tangent  line  is 
equivalent  to  matching  the  chemical  potentials,  as  will  be  demonstrated 
below. 

The  Gibbs  free  energy  G,  or  in  practice  the  Helmholtz  free  energy  F,  for  a 
given  phase  can  be  written  as  a  sum  of  the  products  of  the  number  of 
particles  Nj  and  the  chemical  potentials  pi  of  all  the  atom  and  molecular 
species  i  in  that  phase, 

F  =  G  =  XNi^li  (4.3.1) 

i 

Conversely  the  chemical  potential  can  be  calculated  from 

Pi  =  OF/aNi)(N,i^j)  (4.3.2) 

Now  consider  the  equilibrium  between  two  phases,  denoted  by  a  and  p,  of  a 
binary  alloy  Ai-xBx  at  a  given  temperature  T,  such  as  depicted  in  Fig. 
4.3.1b.  Let  the  common  tangent  line  touch  the  two  free  energy  curves  Fa 
and  Fp  at  Xa  and  xp  respectively.  Denote  the  chemical  potential  of  an  A 

atom  in  the  a  phase  pXC^a.T)  as  and  assign  similar  meanings  to  pg  ,  pj. 


and  pj. 

Algebraically,  the  common  tangent  line  requires 

3Fa  (xa»  T)  /  9x  =  3Fp  (xp,  T)  /  9x 

(4.3.3) 

and 

Fp(xp,T)  =  Fo  (xo.  T)  +  (xp  -  Xo)  8Fa(Xa,  T)  /  8x  . 

(4.3.4) 

In  terms 

of  the  chemical  potentials,  we  have 

Fa(xa,T)*N[xaPi  +  (l-Xa)pa. 

(4.3.5) 

Fp  (xp,T)  =  N  X,,  p|  +  (1  -  Xp)  pj] , 

(4.3.6) 

F-U 


dFa(Xa.T)/ax  =  N(4§-uX). 


(4.3.7) 


aFp(xp.T)/ax  =  N(^J-4i). 


(4.3.8) 


Using  these  expressions  in  Eqs.  (4.3.3)  and  (4.3.4)  leads  to  the  usual 
conditions  for  chemical  equilibrium,  p^and  = 

Very  often  the  activity  coefficients  y  rather  than  the  chemical  potentials 
p  are  used  in  phase  diagram  evaluations.  The  activity  coefficient  yi  is 
related  to  the  chemical  potentials  by 


Pi-p?  =  kTln(xiy,)  .  (4.3.9) 

where  pf  is  the  chemical  potential  for  the  pure  phase  of  the  i^i^  species. 
For  an  ideal  solution,  i.e.  a  random  mixture  which  is  always  found  in  the 
limit  T  oo,  all  the  activity  coefficients  become  unity,  yj  -  1.  In  the 
zeroth  approximation  for  a  binary  Ai-xBx  (or  a  pseudobinary  Ai-xBxC),  the 
activity  coefficients  can  be  shown  to  be  given  by 

yA  =  e’‘^(RT),  (4.3.10) 

7b  =  (4.3.11) 

In  QCA,  using  Eq.  (4.2.11)  and  (4.2.12),  one  finds 


(l-x)  (l+f) 


■r 


n 


(4.3.12) 


76  = 


\  x(l+0 


r 


n 


(4.3.13) 


where  f  is  related  to  the  excess  energy  e  by 


f  =  [l+  4xy  (e2«P-  1)*^] 


(4.3.14) 
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with  ^  »  1/(kT).  These  results  will  be  used  in  the  next  two  sections. 


4.4  Vieland’s  Method  and  a  Binary  Liouidus 

The  Vietand  (1963)  method,  which  establishes  the  relation  between 
chemical  potentials  of  a  stoichiometric  solid  AB  compound  and  those  of 
the  super-cooled  liquid  solution  A0.5B0.5  through  the  entropy  of  fusion  and 
heat  capacity  differences  between  the  two  phases,  remains  the  principal 
way  of  calculating  phase  diagrams  between  semiconductor  solids  and 
liquids. 

Consider  the  equilibrium  between  a  liquid  solution  Ai.xBx  and  a  solid 
compound  AB.  A  schematic  liquidus  curve  is  shown  in  Fig.  4.4.1a  while  the 
corresponding  free  energy  diagram  at  a  given  temperature  T  is  given  in 
Fig.  4.4.1b.  In  equilibrium,  the  chemical  potential  pab  per  AB  pair  (unit 
cell)  in  the  AB  compound  must  be  equal  to  the  sum  of  the  chemical 
potentials  in  the  liquid  solution 

Pab  =  Pa  (xo)  .  (4.4.1) 


where  xq  is  the  point  at  which  the  common  tangent  line  touches  the  liquid 
free  energy  curve  shown  in  Fig.  4.4.1b.  The  free  energy  of  the  solid 
compound  with  N  unit  ceils  at  this  temperature  is  of  course  Fs  -  NpAB-  Fs 
can  be  related  to  the  free  energy  of  a  super-cooled  50  -  50  liquid  solution 
by  following  a  sequence  of  quasi-equilibrium  steps.  First,  heat  one  mole 
of  an  AB  crystal  from  T  to  the  melting  temperature  Tm,  i.e.,  going  from  the 
point  marked  S  to  M  in  Fig.  4.4.1a.  The  change  in  the  free  energy  in  this 
step  is 


F, 


(Tm)-F,(D  =  - 


S,  (T’)dr  , 


(4.4.2) 


The  subscript  s  denotes  the  solid  phase.  The  entropy  S  is  related  to  the 
heat  capacity  Cs  by 


F-16 


Fioure  4  4  1  (a)  Schematic  liquidus  curve  for  a  compound 

‘  ^  /K\  enerov  curve  R)  as  a  function  of  composition  x 

r.  ’  ‘"I  *• 


coexist. 
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(4.4.3) 


Ss  (T)  —  Ss  (Tm) 


CsiT) 
I  T* 


dT  . 


Combining  the  two  equations  above  yields, 


Fs  (Jm)  -  Fs  (T)  =  -  S.  (Tm)  (Tm-  T)  +  dT'dT  (4.4.4) 


The  second  step  is  to  melt  the  crystal  at  Tm  which  causes  no  change  in  the 
free  energy.  This  situation  corresponds  to  Figure  4.4.1b  with  the  liquid 
free  energy  curve  lowered  to  pass  through  the  solid  free  energy  minimum 
labeled  S. 

Finally  the  liquid  is  cooled  back  from  Tm  to  T  (a  super-cooled  liquid), 
which  changes  the  free  energy  by 


(T)  -  F|^  (Tm)  =  -  (Tm)  (T'  Tm)  -  ^  ^  ^rdT  (4.4.5) 

The  subscript  indicates  the  liquid.  The  total  change  is 

Fn  (D-Fs  (T)=fS(l  (Tm)-Ss(Tm)l(Tm-T) 

T  T  r  .  (^•^•6) 

+  cn-c,cn];^d’rdr . 


In  terms  of  chemical  potentials,  Eq.  (4.4.6)  can  be  written  for  one  mole  of 
AB  compound  as. 


No  [(^A  (0.5)  +  PB  (0.5)  -  Pab]  =  No  Apf? 


=  ASm  (Tm  -  T)  -  ACfTm  -  T  -  Tin  (Tm/T)] 


(4.4.7) 
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where  AC  =  C&  (T)  -  Cs(T)  is  assumed  to  be  independent  of  T,  and  the 
entropy  of  fusion  is  defined  as  ASm=  S{i(Tm)  -  Ss(Tm)  for  one  mole  of  AB 
compound.  Using  Eq.  (4.4.1)  for  |xab  and  the  relation  between  the  chemical 
potentials  and  activity  coefficients  in  Eq.  (4.3.9),  Eq.  (4.4.7)  can  now  be 
written  as 


In 


1  1  Ya  (0-5)  yB  (0-5) 

4xo(l  -  xo)J  Ya  (xo)  7b  (xo)  . 


(Tm 


-T-Tln 


(4.4.8) 


This  is  Vieland's  (1963)  formula.  It  is  a  simple  transcendental  equation 
for  T  as  a  function  of  Xo  -  the  liquidus  curve.  When  the  liquid  solution  is 
in  equilibrium  with  the  pure  solid  B  phase,  the  liquidus  is  governed  by  a 
similar  equation 


In 


_ 1  — 1  (T®  -  T)  - 

xo7b(xo)J  RT  ^  ^  RT 


/t®  ^ 


(4.4.9) 


To  actually  calculate  the  liquidus  curve  using  Eqs.  (4.4.8)  and  (4.4.9),  one 
needs  to  know  the  activity  coefficients  Ya  and  Yb,  which  of  course,  depend 
on  the  statistical  model  and  the  energetics.  If  QCA  is  used,  these  activity 
coefficients  are  given  explicitly  by  Eqs.  (4.3.12)  and  (4.3.13).  Then  the 
remaining  quantities  needed  are  the  mixing  energy  parameter  Q  -  Noze  for 
the  binary  liquid,  the  fusion  entropy  ASm>  and  the  heat  capacity  difference 
aC.  In  most  applications  AC  has  been  taken  to  be  zero,  which  is  a  good 
approximation  at  high  temperature.  This  approach  works  reasonably  well 
for  the  lll-V  binary  compounds.  Examples  are  shown  in  Fig.  4.4.2,  where 
calculated  liquidus  curves  (Stringfellow  and  Green,  1969;  Kikuchi,  1981) 
for  GaAs,  In  As  and  InSb  are  compared  with  experiment.  The  parameters 
used  in  the  calculations  are  listed  in  Table  4.4.1.  Note  that  the  entropy 
unit  eu  used  for  ASm  is  eu  -  J/(K-mole).  The  symmetrical  liquidus  curves 
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indicate  that  the  QCA  model,  with  coordination  number  z  -  4,  is  a 
reasonable  representation  of  these  binary  melts.  This  approach,  however, 
does  not  work  well  for  the  II-VI  systems.  As  shown  in  Fig.  4.4.3,  the 
skewed  and  sharp  nature  of  the  experimental  curves  indicates  that  these 
liquid  solutions  are  more  complex  than  the  simple  picture  represented  by 
QCA  for  the  binary  solutions.  A  simple  extension  of  QCA  to  include  the 
molecule  species  AB  in  addition  to  A  and  B  in  the  liquid  solutions  is  found 
to  work  well, as  indicated  by  the  comparison  with  experiment  in  Fig.  4.4.3. 
This  extended  QCA  model  is  called  the  associated  solution  model.  Kikuchi 
(1982)  found  that  repulsive  interactions  between  the  molecular  and 
atomic  species  in  the  solution  are  responsible  for  the  skewed  nature  of 
the  liquidus  curves. 


Table  4.4.1 

Parameters  used  in  the  calculation  of  liquidus  curves  in  Fig.  4.4.2  and 
_ phase  diagrams  in  Fig.  4.5.4. _ 


Melting  Temperature 

Tm(K) 

Entropy  of  fusion 

ASm(eu) 

Mixing  enthalpy  parameter 

Cl  (calAnole) 

GaAs 

1511 

GaAs 

14.7 

Ga-As 

-4380 

InAs 

1215 

InAs 

14.7 

In-As 

-6070 

InSb 

803 

InSb 

13.3 

In-Sb 

-3980 

As 

1090 

As 

3.67 

In-Ga 

1066 

Sb 

903 

Sb 

5.27 

As-Sb 

610 

GaAs-InAs 

2800 

InAs-InSb 

2900 

After  Stringfellow  and  Green  (1969) 
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4.5  Ternary  Phase  Diaoram 

The  methods  of  Sec.  4.4  are  next  extended  to  the  phase  diagrams  for 
ternary  systems.  The  phase  boundary  curves  are  the  solidus-liquidus 
curves  that  describe  the  equilibrium  between  the  pseudo-binary  solid 
solutions 

Ai-xBxC  and  the  liquid  ternary  solution  denoted  by  AaBbCc.  where  a,  b,  and 
c  are  fractional  concentrations  satisfying  normalization  a+b+c  -  1.  A 
schematic  three-dimensional  phase  diagram  is  shown  in  Fig.  4.5.1;  it  is  a 
plot  of  the  temperature  T  versus  concentrations.  The  liquidus  is  the  upper 
surface  and  the  solid  pseudobinary  phase  is  the  plane  lying  between  the 
two  lines  labeled  AC  and  BC.  A  given  set  of  a.b,  and  c  values  corresponds 
to  a  point  D  inside  the  base  triangle  ABC,  as  shown  in  Fig.  4.5.2.  The 
values  of  a,  b,  and  c  are  related  to  the  geometric  line  ratios  by  a  *> 

DA7AA',  b  «  DBVBB'  and  c  -  DCVCC*.  The  solidus-liquidus  boundaries  form 
a  family  of  curves  relating  the  solid  concentration  x  and  the  temperature 
T  to  the  two  independent  liquid  concentrations  a  and  b  (note  c  -  1  -  a  -  b). 
These  relations  can  be  obtained  by  matching  the  chemical  potentials  p. 
described  below. 

Let  us  start  with  pAC  and  pec  for  the  AC  and  BC  components  in  the  solid 
solution  Ai-xBxC.  They  are  related  to  those  in  the  pure  AC  and  BC 
compounds  through  the  activity  coefficients  y 


Pac  (X.T)  =  pJc  (D  +  kTln  [(1  -  x)  Yac]  . 

(4.5.1) 

and 

Pbc  (x,T)  =  p£c  (T)  +  kTln  {xy^c)  * 

(4.5.2) 

where  the  superscript  0  designates  pure  compounds.  Now  Vieland's 
relation  in  Eq.  (4.4.7)  can  be  used  to  relate  the  chemical  potentials  pO  of 
the  pure  solid  compounds  to  those  of  the  super-cooled  50-50  binary  liquid 
mixtures,  denoted  psc.  For  example,  p^c 's  given  by 

Pac  =  Pa+  P^-Apf^  . 

(4.5.3) 
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Figure  4.5.2  Any  point  D  inside  the  triangle  ABC  defines  a  unique  set 
of  fraction  concentrations  in  ternary  alloy  AaBbCc  by  a  -  DAVAA',  b  • 
OB'/BB',  and  c  -  DCVCC. 
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Note  that  A|a  is  expressed  in  terms  of  a  change  in  the  heat  capacity  AC  and 
entropy  of  fusion  ASm  between  the  50-50  binary  liquid  and  the  solid 
compound  AC  given  by  the  right  side  of  Eq.  (4.4.7).  Since  the  ternary  liquid 
is  in  equilibrium  with  the  solid  pseudo-binary.  and  pqq  in  the  latter 

are  equal  to  the  sum  of  the  chemical  potentials  for  the  atomic  species  in 
the  former  by 

PAC  (x.T)  =  pj  (a,b,;T)  +  p^  (a,b:T)  .  (4.5.4) 


A  similar  expression  holds  for  P3Q.  The  p^  for  the  atomic  species  in  the 
ternary  liquid  can  be  further  expressed  in  terms  of  those  of  the  pure 
elementary  liquids  and  activity  coefficients  by 

pj=  pJ  +  kTln(aYA)  •  (4.5.5) 


Combining  the  above  three  equations  with  Eq.  (4.5.1).  one  obtains  the 
equilibrium  condition 


ln[(l-x)yAc]  =  ln 


acYAlt 

kfc'/ 


(4.5.6) 


Similarly.  Eq.  (4.5.2)  leads  to 


In  (xTfbc)  =  In 


bcTbYc 


(4.5.7) 


To  calculate  the  phase  diagram  from  Eqs.  (4.5.6)  and  (4.5.7).  one  needs  to 
know  the  x  and  T  dependence  of  the  activity  coefficients  yac  Tec 

pseudobinary  solid  solutions,  the  T  dependence  of  the  activity  coefficients 
Y®®  of  the  super-cooled  50-50  binary  liquids,  and  the  T  and  concentration 
(a.b,  and  c)  dependences  of  7^^  Yb.  and  Yc  in  the  ternary  liquid  solutions 
AaBbCc-  In  addition  AC  and  ASm  are  needed  for  the  calculation  of  the  Ap.  If 
QCA  is  used,  then  the  activity  coefficients  for  the  binary  liquid  and  pseud¬ 
binary  solid  solutions  take  the  forms  of  Eqs.  (4.3.12)  and  (4.3.13).  For  the 
ternary  liquid  solution.  QCA  has  to  be  extended  to  three  components. 

There  are  now  six  different  kinds  of  pairs  to  be  considered,  namely.  AA. 
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BB,  CC,  AB,  BC  and  AC.  We  label  these  pairs  by  j,  where  j  runs  from  1  to  6, 
and  denote  the  fractions  of  each  pair  as  xj.  If  independent  pair  energies 
Eap  are  assumed,  there  are  only  three  nonvanishing  energy  parameters,  Aab. 
Aac.  and  Abc.  while  Aaa  -  Abb  -  Acc  -  0,  where  we  have  defined  Aap  -  Sap  - 
(Eaa  +  £pp)/2.  Then  it  is  straightforward  to  write  the  mixing  energy  as 

6 

AE  =  X  MxjAj .  (4.5.8) 

j=l 

where  M  «  Nz/2  with  z  being  an  effective  coordination  number,  taken  to  be 
4.  The  mixing  entropy  is  AS  «  klnO,  with  O  given  by 


N! 

NaINbINc! 


n  Mj!  >■' 


(4.5.9) 


where  Mj  «  xjM  and  xj  are  the  a  priori  probabilities,  e.g.,  x^  =  a2and 
x^  =  2abetc.  This  O  is  a  simple  extension  of  Eq.  (4.2.9).  The  mixing  free 
energy  then  reads 


AF  =  ^  \{xjAj  +  kTXj  ln(  xj  /  x^)]  +  NkT  (alna  +  blnb  +  clnc) . 

(4.5.10) 

Note  that  not  all  the  xj  are  independent.  They  are  constrained  to  have  the 
right  numbers  of  A  B,  and  C  atoms  for  a  given  set  of  fractional 
concentrations,  a,  b.  and  c: 

2xaa  +  xab  +  Xac  *  2a , 

(4.5.11) 

2xbb  +  xab  +  XBC  =  2b , 

(4.5.12) 

2xcc  +  xcB  +  Xac  *  2c , 

(4.5.13) 

Minimization  of  AF  with  respect  to  the  xj  under  the  imposed 
leads  to  the  following  three  coupled  equations: 

constraints 

(2a-XAB  -  xac)  (2b-XAB  -  xbc)  =  x^  exp  [Qab  /  (2RT)] ,  (4.5. 14) 


F-27 


(2a-XAB  -  xac)  (2c-xac  -  xbc)  =  exp  [^Iac  /  (2RT)] ,  (4.5. 1 5) 

(2b-XAB  -  XBc)  (2c-xac  -  xbc)  *  x|c  exp  [Dbc  /  (2RT)] ,  (4.5. 16) 

where  the  mixing  energy  parameters  are  defined  as  -  Nq  zAop ,  with  z 
taken  to  be  four.  After  Eqs.  (4.5.11)  through  (4.5.16)  are  solved  for  xj, 
they  are  used  in  Eq.  (4.5.10)  to  obtain  the  functional  dependence  of  AF  on 

the  concentrations  a,  b,  and  c  and  on  the  temperature  T.  Then  the  activity 
coefficients  can  be  calculated.  For  example,  is  calculated  from 

k'nn(aYA)  =  0AF  /  (4  3  17^ 

-OAF/aa)0a/dNA)  +  (dAF  /  db)  (db  /  dNA)  . 

In  this  equation,  a  and  b  have  been  chosen  as  independent  variables  with  c 
«  1  -  a  -  b. 

Thus  the  parameters  needed  to  complete  the  specification  of  a  ternary 
phase  diagram  in  QCA  are  AC  and  ASm  for  both  AC  and  BC  compounds,  the 
mixing  energy  parameters  £>ab.  Aac>  and  Obc  for  the  ternary  liquid  and 
^AB(C)  for  the  pseudobinary  solid  solution.  It  is  a  good  approximation  to 
set  AC  «■  0.  There  are  four  independent  variables,  say,  a,  b,  x,  and  T,  and 
two  equations.  Eqs.  (4.5.6)  and  (4.5.7)  with  which  to  work.  One  can  fix  T 
and  solve  for  a  (and  hence  c  «  1  -  a  -  b)  and  x  as  functions  of  b.  Plots  of 
the  contour  of  a,  b,  and  c  for  the  same  T  in  the  Gibbs  triangle  ABC,  is  an 
isotherm.  If  the  solidus  concentration  x  is  fixed,  the  contour  of  a,  b,  and  c 
in  the  Gibbs  phase  plot  is  the  iso-solidus-concentration  line.  Figure  4.5.3 
shows  both  sets  of  these  plots  for  the  ln-Ga>As  system  reported  by 
Stringfellow  and  Green  (1969)  using  the  parameters  given  in  Table  4.4.1. 
We  can  fix  c  «  0.5  in  the  ternary  and  solve  for  x  and  T  as  a  function  of  a, 
then  we  obtain  the  liquidus  and  the  solidus  curves  along  the  c  •  1/2  line, 
as  shown  in  Fig.  4.5.4,  also  by  Stringfellow  and  Green  and  further  verified 
by  Kikuchi  (1981).  The  agreement  between  the  theory  and  experiment  is 
remarkably  good. 
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Ill-v  TERNARY  PHASE  DIAGRAMS 
As 


Figure  4.5.3  Isotherms  and  iso-solidus  concentration  lines  for  phase 
equilibrium  between  pseudobinaries  Ini.xGaxAs  and  the  corresponding 
ternary  liquid.  After  Stringfellow  and  Green  (1969). 
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■c  1100 


•c  700 


Figure  4.5.4  Liquidus  and  Solidus  curves  for  Ini.xGaxAs  and  InSbi 
xAsx-  Solid  curves  are  calculations  and  the  circles  are  the 
experiments.  After  KIkuchi  (1981). 


Before  moving  into  more  rigorous  statistical  theory  and  more  systematic 
energetic  calculations,  it  is  useful  to  summarize  the  semiempirical  phase 
diagram  data  and  simple  statistical  and  energetic  models  that  have  been 
used  in  practical  calculations. 

From  a  survey  of  phase  diagram  calculations  of  ternary  and  quaternary 
semiconductors,  Stringfellow  (1974)  preferred  the  following  forms  for 
mixing  free  energies.  For  the  lll-V  liquid  solutions,  the  preferred  form 
(per  mole)  is 

AFjj^  =  x(l-x)  (^1  -  ^2T)  +  RT  (xlnx  +  ylny)  (4.6.1) 

For  the  Ill-Ill  and  V-V  liquids  and  pseudobinary  solids,  the  strict  regular 
solution  model  is  his  preference, 

AF  =  x(l-x)n  +  RT  (xlnx  +  ylny)  (4.6.2) 

Table  4.6.1  lists  the  values  of  Q  and  other  phase  diagram  data 

including  the  entropy  of  fusion  ASm,  the  melting  temperatures  Tmfor  the 
lll-V  binary  systems,  and  the  mixing  energy  parameters  for  the  Ill-Ill  and 
V-V  interactions  in  the  liquid  solutions.  In  Table  4.6.2,  the  experimental 
values  of  ^  based  on  Eq.  (4.6.2)  are  compared  with  the  mixing  enthalpies 
calculated  from  several  theoretical  models  to  be  discussed  presently.  One 
has  to  be  cautious  in  comparing  the  experimental  £2  with  the  calculated 
values,  because  the  former  may  include  additional  contributions  from 
entropy  terms  not  contained  in  the  second  term  of  Eq.  (4.6.2). 

One  simple  model  for  estimating  the  mixing  energy  is  the  so-called  delta 
lattice  parameter  model  (DLP)  suggested  by  Stringfellow  (1974).  He 
observed  that  the  experimental  £2  values  for  the  lll-V  pseudobinary  alloys 
Ai-xBxC  correlated  strongly  with  the  difference  in  the  lattice  constant 
Aa  ■  aeC'  ^ac.  as  shown  in  Fig.  4.6.1  in  a  log-log  plot  generated  by 
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Table  4.6.1 


The  values  of  -  ^2^  in  Eq.  (4.6.1)  for  the  liquid  mixing  enthalpy 


parameter  in  cal/mole  and  other  parameters  in  the  liquid  phase. 


System 

(cal/mole) 

ASm  (eu) 

'  Tm(C) 

AlP 

28(X)-4.80  T 

15.0 

2530 

AIAs 

600-12  T 

15.6 

1770 

AlSb 

1230-10  T 

14.74 

1065 

GaP 

28000-4.8  T 

16.8 

1465 

GaAs 

5160-0.16  T 

16.64 

1238 

GaSb 

4700-6  T 

15.8 

710 

InP 

4500-4  T 

14.0 

1070 

InAs 

3860-10 T 

14.52 

942 

InSb 

3400-12  T 

14.32 

525 

Al-Ga 

104 

Al-In 

1060 

Ga-In 

1060 

P-As 

1500 

As-Sb 

750 

After  Stringfellow  (1974) 


F-32 


Table  4.6.2 

Mixing  energy  parameters  0  in  Kcal/mole.  For  models  in  which  the 
mixing  enthalpy  AH  is  a  function  of  both  x  and  T,  Q  is  defined  as 
AH-x(l-x)n  for  a  random  alloy  (T-®)  at  x-0.5. 


system 

DLP“ 

FM^ 

IMP“ 

MZ" 

CS* 

gqca' 

WF* 

Exp 

(Ga,Al)P 

0.00 

-0.05 

0.01 

(Ga,Al)As 

0.02 

0.03 

0.00 

0.02 

-0.07 

0.03 

0.30 

o.o’’ 

(Ga,Al)Sb 

0.02 

0.03 

0.02 

0.02 

-0.15 

0.12 

0.0** 

(Ga,In)P 

3.63 

2.94 

3.00 

4.56 

2.54 

2.85 

3.07 

3.25) 

3.40* 
2 .  OO) 

3.50“ 

(Ga,In)As 

2.81 

2.42 

2.36 

2.49 

1.60 

2.19 

2.35 

1.65) 

3.00“ 

(Ga,In)Sb 

1.85 

1.83 

1.77 

2.53 

0.81 

1.72 

1.48) 

1.90** 

(In,Al)P 

2.77 

2.55 

2.66 

(In,Al)As 

2.81 

2.37 

2.32 

3.60 

2.17 

2.26 

2.50“ 

(In,Al)Sb 

1.46 

1.45 

1.49 

2.06 

1.36 

1.46 

0.60“ 

(Cd,Zn)Te 

1.97 

1.63 

1.73 

2.12 

1.24 

1.59 

2.29 

1.34“ 

(Hg,Cd)Te 

0.00 

-0.07 

0.03 

0.38 

0.72" 

1.40“ 

(Hg,Zn)Te 

1.81 

1.48 

1.56 

1.91 

1.50 

1.55 

1.88 

3.00“ 

Al(P,As) 

0.65 

0.76 

0.67 

Ga(P,As) 

0.98 

0.66 

0.70 

1.15 

0.94 

0.74 

0.91 

0.40j 

i.oo) 

1.26** 

In(P,As) 

0.58 

0.52 

0.52 

0.72 

0.57 

0.52 

0.40** 

Al(As,Sb) 

3.38 

4.09 

3.45 

Ga(As , Sb) 

3.35 

2.76 

2.81 

4.38 

3.67 

2.92 

4.00‘ 

4.27^ 

2.90'’ 

4.50“ 

In(As,Sb) 

2.29 

2.17 

2.23 

2.89 

2.52 

2.20 

2.25^ 

Al(P,Sb) 

6.99 

8.32 

6.94 

Ga(P,Sb) 

6.36 

8.66 

6.57 

In(P,Sb) 

5.08 

5.76 

4.97 

Zn(S,Se) 

0.98 

0.90 

0.95 

Zn(Se ,Te) 

3.11 

2.12 

2.23 

2.91 

2.26 

2.16 

1.55 

Zn(S,Te) 

6.45 

6.20 

6.12 

a:  Stringfellow  (1974),  0  in  Eq. (4.6.5). 

b:  Fedders  and  Muller  (198i»5 ,  O  ,  in  Eq.  (4.6.7). 

c;  Chen  and  Sher  (1985),  0  in^tq?(4. 6 . 13) . 

d:  Martins  and  Zunger  (1985^^  VFF  model. 

e:  Chen  and  Sher  (1985),  VFF  plus  BOM  perturbation. 

f:  Sher  et  al.  (1987),  the  16 -bond  model  in  Sec.  4.9. 

g:  Wei  et  al.  (1990),  LDA  calculation  described  in  Sec.  4.11. 

h:  Panish  and  Ilgemes  (1972). 

i:  Foster  (1972). 

J :  Antypas  (1970b) . 
k:  Foster  and  Woods  (1971). 

1:  Antypas  (1970a). 
m:  Laugies  (1973). 

n:  Su  et  al.  (1985),  value  taken  at  lOOOK. 
p:  Stringfellow  (1974,  1982). 
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4.6.1 


Figure  4.6.1  Log-log  plot  of  experimental  values  of  ft  versus 

percentage  lattice  constant  difference.  The  solid  line  represents  the 
best  straight-line  fit.  After  Stringfellow  (1974). 
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Stringfellow  (1974).  The  best  straight  line  fit  gives  a  slope  of  2.45, 
yielding  the  following  fitted  function: 

flfit  =  1.174  X  10®  (Aa/a)2-^5  ,  (4.6.3) 

where  a  «  (aAC  +  aBC)/2.  This  result  prompted  Stringfellow  to  correlate 
with  Phillips  and  Van  Vechtan's  (1970)  dielectric  model,  which  relates 
the  alloy  formation  energy  to  the  energy-gap  parameter  Eh  [see  Eq.  (2.6.4)], 
which  in  turn  is  roughly  proportional  to  a-^-^.  By  simply  assuming  that 
the  formation  energy  has  the  form 

E  =  -K/a2-5  (4.6.4) 

for  every  semiconductor,  then  the  mixing  enthalpy  is  given  by 

AH  =  E(alloy)  -  xE(BC)  -  (1-x)  E  (AC) 

=  Mioy  -  X  a^,^  -  (1-x)  axV]  (4.6.5) 

=  4.375  Kx  (1-x)  (Aa)^  /  a^*^  =  x(l-x)  Qdlp 


in  Eq.  (4.6.5)  the  alloy  lattice  constant  aaiioy  is  assumed  to  be  the 
concentration  weighted  average  (Vegard's  Law),  and  only  terms  to  second 
order  in  Aa  in  the  Taylor  expansion  have  been  kept.  If  k  is  treated  as  an 
adjustable  parameter,  the  best  fit  to  the  lll-V  alloy  data  yields  k  -  1.15 
for  Q  in  kcal/mol  and  Aa  and  a  in  A.  This  model  gives  very  good  results,  as 
indicated  in  Table  4.6.2. 

Arguing  from  a  different  point  of  view  Fedders  and  Muller  (1984) 
suggested  that  the  Aa^  dependence  in  Eq.  (4.6.5)  might  originate  from 
strain  energy.  If  the  alloying  atoms  A  and  B  are  held  rigidly  on  their  fee 
sublattice  sites,  and  if  the  alloy  lattice  constant  is  taken  to  be  the 
concentration  weighted  average  (Vegard's  law),  then  the  strain  energy  in 
the  alloy  can  be  shown  to  be  AE  «  x(1-x)£2fm  with  a  mixing  energy 
parameter  given  by 

nFM  =  |BV(Aa/a)^  ,  (4.6.6) 

where  B  is  the  average  bulk  modulus  and  V  the  average  molar  volume. 
Because  B  in  semiconductors  varies  as  a**^  with  n  ranging  from  5  to  9  (see 
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Chapter  3),  Qfm  is  seen  to  behave  like  (Aa)2/an-i.  Thus  the  power  law 
dependences  on  Aa  and  a  in  Adlp  and  Qfm  are  similar.  However,  £2fm  is 
about  four  times  larger  than  the  experimental  values.  In  fact  a  scaled 
down  strain  contribution, 

^scale  *  0.226  OpM  (4.6.7) 

yields  results  comparable  to  those  of  Hdlp-  Tha  reduction  of  Qfm  is 
mainly  due  to  bond  length  relaxation  neglected  in  the  above  derivation. 

If  strain  energy  is  the  main  contribution  to  the  mixing  enthalpy,  then  the 
valence  force  field  model  (VVF)  given  in  Eq.  (3.3.1)  should  provide  a  means 
to  formulate  a  systematic  calculation.  One  can  replace  an  A  atom  in  the 
AC  crystal  by  an  impurity  B  atom  and  calculate  the  substitution  energy 
AE(B  in  AC)  and  vice  versa  for  aE(A  in  BC),  then  the  mixing  enthalpy 
parameter  is  given  by  (Chen  and  Sher,  1985) 

flimp  =  [AE(A  in  BC)  +  AE(B  in  AC)]  /  2  .  (4.6.8) 

To  calculate  these  energies  precisely  in  VFF,  one  has  to  allow  the  atoms 
in  the  first  several  shells  about  the  impurity  to  relax.  The  results  of  such 
calculation  (Martins  and  Zunger,  1984;  Chen  and  Sher,  1985)  do  give  good 
results  for  mixing  energy  and  bond  lengths  between  the  Impurity  and  the 
first  shell  host  atoms.  However,  it  was  found  (Chen  and  Sher,  1985)  that 
by  neglecting  the  bond  angle  terms  In  the  VFF  (p  -  0)  and  at  the  same  time 
truncating  the  range  of  the  atomic  relaxation  at  the  second  shell,  a  simple 
spring  model  similar  to  that  shown  In  Figure  3.7.1  yields  nearly  the  same 
results  as  the  extended  VFF  for  both  the  impurity  bond  lengths  and  alloy 
mixing  energy.  It  is  instructive  to  examine  this  simple  spring  model 
closely.  Let  the  sixteen  bonds  before  the  substitution  be  the  host  BC 
bonds  with  VFF  force  constant  a  and  bond  length  d.  When  the  central  B 
atom  is  removed  and  an  A  atom  is  substituted  in  its  place,  the  bond  length 
di  of  the  first  shell  and  dz  of  the  second  shell  are  different  from  d.  The 
strain  energy  in  VFF  with  p  -  0  ,and  assuming  the  second  shell  atoms  are 
held  firmly  in  place.  Is  then  given  by 

AE(A  in  BC)=  4[|ai  (di  -  di)^]  +  12  [|  a  (dz  -  d)^]  (4.6.9) 
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where  the  subscript  i  denotes  the  impurity  bond  in  its  own  crystal  (i.e., 
the  AC  bond  in  the  pure  AC  crystal).  Let  5  -  (di  -  d)/d  and  6o  -  (d|  -  d)/d, 
then  in  the  relaxed  configuration  {62-  d)/d  -  -S/3  and  Eq.  (4.6.9)  becomes 


AE(A  in  BQ=  4[|ai  (8  -  80)^  +  ^  a8^ 


d2 


(4.6.10) 


Minimizing  AE  with  respect  to  8  yields  the  following  value  of  d  at  the 
equilibrium  position 

8  =  8o/(l+l^j  .  (4.6.11) 

if  both  a  and  aj  are  nearly  equal  then 

8  =  |8o  (4.6.12) 


and  AE  (A  in  BC)  =  Sa&ld'^/l  Similar  results  can  be  obtained  for  AE(B  in 
an  AC  host).  Equation  (4.6.9)  then  yields  the  following  estimate  for  the 
mixing  energy  parameter  from  this  impurity  model, 

fliinp  =  3a^d2/2  (4.6.13) 

Note  that  this  model,  Ojmp  is  precisely  one  quarter  of  ftpM-  Thus  this 
simple  relaxed  lattice  model  not  only  explains  why  the  strain  energy  is 
reduced  by  1/4  but  also  gives  a  very  good  prediction  of  the  value  y  -  0.75 
for  the  lattice  relaxation  [compare  Eq.  (4.6.12)  with  the  definition  of  y  in 
Eq.  (1.3.1)  and  Table  1.3.1]  found  in  pseudobinary  alloys  from  the  EXAFS 
experiments.  While  this  short  range  strain  model  with  p  set  to  zero 
produces  accurate  results,  keep  in  mind  that  it  is  not  the  way  nature 
actually  behaves.  The  approximation  works  only  because  effects  from 
long  range  relaxation  nearly  cancel  the  lowest  order  modifications  caused 
by  the  p  terms.  If  one  asks  questions  that  involve  shears,  or  for  more 
detailed  structural  information,  e.g.,  the  second  neighbor  distances,  then 
effects  due  to  p  and  long  range  relaxations  must  be  included. 


4.7  Generalized  Quasi-Chemicat  Theory  (GQCAl 

There  are  at  least  two  reasons  to  extend  the  theory  beyond  the  first 
approximation.  First,  the  effects  of  the  neglected  statistical  correlations 
should  be  improved.  Second,  and  perhaps  more  importantly,  the 
microscopic  energies  that  govern  the  statistics  may  not  be  described  by 
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simple  pair  energies.  In  this  section,  QCA  is  generalized  to  deal  with 
clusters  of  any  size. 

Let  each  cluster  contain  n  atoms  on  the  sublattice  occupied  by  the  A  and  B 
atoms  and  let  J+1  be  the  number  of  different  configurations  in  which  the 
A  and  B  atoms  can  be  arranged  on  a  cluster  with  a  distinct  energy  ej,  for 
j  »  0,  1,  ...,  J.  For  example,  in  the  case  of  a  tetrahedral  cluster  in  which  ej 
differs  only  when  the  number  of  B  atoms  on  the  cluster  changes,  then  the 
index  j  will  equal  the  number  of  B  atoms  on  the  cluster,  and  therefore 
takes  on  five  values  j  -  0,  1 ,  2,  3,  and  4.  However,  for  a  square  with  two  A 
atoms  and  two  B  atoms  on  the  corners  the  energy  for  two  atoms  on  the 
same  side  may  be  different  from  that  when  they  are  on  a  diagonal.  So  J 
may  be  larger  than  n.  Obviously,  the  degeneracies  of  arrangements  with 
the  same  ej  tend  to  decrease  if  J  >  n  since  the  total  number  of 
configurations  is  fixed,  i.e.,  for  a  4  site  case,  it  is  2^.  Let  M  be  the  total 
number  of  clusters  in  the  alloy,  Mj  be  the  number  of  the  j-type  clusters  so 
that  M  -  £  Mj.  and  xj  -  Mj/M  the  fraction  of  the  j-type  clusters.  Let  us 

further  define  nf  (n?)  to  be  the  number  of  A  (B)  atoms  in  a  j-type  cluster, 
and  the  notation  nj  will  stand  for  .  Since  the  concentration  x  is  fixed, 
the  set  of  fractions  xj  are  constrained  to  satisfy 

J 

nx  =  ]^nfxj.  (4.7.1) 

j-o 

The  mixing  energy  for  a  specified  set  {Mj}  is  then  given  by 

AE  =  X  Mj  Cj  -  m[(1-x)  ei(n)  +  x  eg  (n)]  ,  (4.7.2) 

j*0 

where  egcn)  and  eg  (n)  are  the  energies  of  an  n-atom  cluster  in  the  pure  AC 
and  pure  BC  crystals  respectively,  as  indicated  by  the  superscript  0.  We 
note  that  the  cluster  energy  eo  -  eA(n).  which  is  the  energy  in  the  alloy  for 
a  cluster  containing  all  A  atoms,  in  general  differs  from  egfn),  because  of 
the  influence  of  strain  and  different  chemical  bonding.  Let  sa  .  eo  -  eA(n) 
and  similarly  for  ee  -  ej  -  eg  (n)  .  Then  Eq.  (4.7.2)  can  be  written  as 

AE  =  M  [(l-x)  Ca  +  xeb]  +  Mlj  xjAj  ,  (4.7.3) 
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where  the  reduced  excess  energies  Aj  are  defined  as 


*  n-nj 
Aj  —  6j  - 


eo'Tfej 


(4.7.4) 


n  n 

and  use  has  been  made  of  Eq.  (4.7.1)  Note  that  by  definition  Ao  and  Aj  are 
zero. 


All  the  temperature  dependence  arising  from  the  statistical  mechanics  is. 
as  we  shall  see,  contained  in  the  xj  factors.  Therefore,  the  first  term  of 
aE  in  Eq.  (4.7.3)  does  not  influence  the  temperature  dependence  arising 
from  averaged  statistical  quantities  for  a  given  x;  nearly  all  the 
temperature  dependence  is  in  the  small  second  term.  The  mixing  entropy 
can  be  calculated  from  kInO  with  O  given  by  [an  extension  of  Eq.  (4.2.9)] 


=  _ M!_ 

Na’.Nb!  Yl  Mj! 
j 


n  (x?r' . 


(4.7.5) 


where  is  the  a  priori  probability,  i.e.,  the  random-alloy  value 


xJ  »  gj  (l-x)<"-“i>xn)  ,  (4.7.6) 

and  gj  is  the  degeneracy  of  clusters  with  energy  ej.  The  resulting  mixing 
entropy  takes  a  simple  form 


AS  =  -Nk  (xlnx  +  ylny)  -  MkJ  xj  In  (xj  /  xJ)  .  (4.7.7) 

j 

Eqs.  (4.7.2)  and  (4.7.7)  thus  combine  to  give  an  explicit  expression  for  the 
mixing  free  energy  AF  «  AE  -  TaS  in  terms  of  xj. 

To  find  the  equilibrium  cluster  probability  distribution  (xj),  one  takes  the 
partial  derivatives  dAF/dxj  and  sets  them  to  zero.  Although  there  are  J  -t-  1 
unknown  xj's,  there  are  two  independent  constraints,  one  given  by  Eq. 
(4.7.1)  and  the  other  is  that  the  total  probability  is  unity, 

2  Xj  - 1  .  (4.7.8) 

j 

This  leaves  us  with  J  -  1  nonlinear  coupled  equations  to  be  solved  for  the 
thermally  averaged  (xj)  While  the  set  {xj}  are  the  equilibrium  values  of 
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the  msre  general  set  {xj},  we  shall  leave  the  bar  off  since  it  causes  no 
confusion.  A  simple  approach  to  the  numerical  problem  is  to  use  the 
Lagrange  multiplier  formalism  in  the  constrained  variational  calculation 
for  the  {xj}  set,  i.e., 

^  |aF  /  M  -  Y Xj j  - 1  j  -  a  j’lX  nj  Xj|-  "xjj  =  0  •  (4.7.9) 

This  relates  Xj  to  the  Lagrange  multipliers  y  and  a 

Xj  =  x]*  exp  [(y  +  otnj  -  Aj)/kT]  .  (4.7. 10) 

The  normalization  condition  (4.7.8)  is  then  used  to  eliminate  y  and  obtain 
the  following  result 

xj  =  xjcxp[(anj- Aj)/kT]/X  xjcxp  [(“«)- Aj)/kT]  .  (4.7.11) 


The  algebra  is  simplified  by  defining  a  new  variable  r\  >  xe^^/y.  Then 

Xj  =  gjTi"ic*P^/Z  (4.7.12) 

where  p  «  1/(KT),  and  Z  is  a  cluster  grand  partition  function 


Z  =  X  (4.7.13) 

j 

In  general  the  thermal  average  of  any  dynamic  variable  D  is 


D  =  XDjXj  (4.7.14) 

j 

Then  the  expression  for  xjin  Eq.  (4.7.12)  is  used  in  the  other  constraint 
equation  (4.7.1)  to  arrive  at  the  following  n^h  order  polynomial  equation 
for  the  unknown  parameter  q 

"j  =  X  =  X  "J  Z  =  nx  (4.7. 15) 

i  j 
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We  shall  refer  to  the  above  argument  as  the  generalized  QCA  (GQCA),  while 
QCA  is  reserved  for  the  pair  interaction  case. 


To  illustrate  the  usefulness  of  the  equations  developed  in  this  section,  let 
us  consider  the  case  in  which  the  cluster  energies  ej  depend  only  on  the 
number  of  A  and  B  atoms  in  the  cluster.  For  a  cluster  containing  n  atoms 
there  are  a  total  of  n  +  1  distinct  energies  of  the  cluster  type  A(n-j)  B(j) 
with  j  ranging  from  0  to  n  and  with  nj  -  j.  Then  the  cluster  degeneracy  is 
simply 


gj  = 


_ nl  ■ 


(4.7.16) 


In  terms  of  i\  and  Z  the  mixing  free  energy  is  given  by 


AF  =  N0.  (yCA  +  xEb)  +  NkT  [(1  -  nQ.  )  (xlnx  +  ylny)  -  (l.lnZ  + nQ.  xlnq]  ,(4.7.17) 


where  we  have  defined  (I  «  M/N.  Notice  that  a  term  from  AS  has  exactly 


cancelled  the  temperature  dependent  term  X  in  the  enthalpy.  Thus  it 

j 

is  impossible  to  determine  the  temperature  variation  of  AE  from  a 
measurement  of  AF. 


To  determine  the  miscibility  gap,  the  spinodai  curve,  and  critical 
temperature  we  need  to  examine  the  derivatives  of  AF  with  respect  to  x. 
They  are  simply  given  by 


8AF 

dx 


=  Nll 


es-eA  +  y 


8eA 

8x 


+  NkT 


(l  -  nil)  ln(x/y) 


+  n!l  Inn 


(4.7.18) 
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Equation  (4.7.17)  is  obtained  from  Eq.  (4.7.16)  by  taking  the  partial  of  Z 
given  by  Eq.  (4.7.13)  with  respect  to  x,  and  combining  the  result  with  Eqs. 
(4.7.12)  and  (4.7.14)  to  yield  the  relation 


9lnii  91nZ 


9x  \  9x  / 


(4.7.20 


if  the  d/dx  is  also  taken  of  Eq.  (4.7.15),  that  produces  a  second  equation  in 
dlnq/dx  and  9lnZ/9x  which  can  be  solved  along  with  Eq.  (4.7.20) 


1  3Aj|  (dAj 
n+-J-  n; — ^  -nx  — ^ 
i^T^'ax/  \dx 

n?  -  x^ 


(4.7.21) 


\  9x  /  \  9x  j 


-  n2  x2 


(4.7.22) 


Equations  (4.7.18  -  4.7.22)  recognize  that  the  excess  energies  ea>  eb  and 
the  set  {Aj}  have  an  explicit  x  dependence.  This  can  be  seen  in  Fig.  (4.9.2) 
where  this  x  dependence  for  Gai-xInxAs  is  plotted,  in  this  case  the 
variation  of  the  set  {Aj}  is  quite  small.  However,  as  we  shall 
demonstrate  later  for  lattice  mismatched  alloys  ea  s  eao  and  eb  s  ebo  are 
good  approximations  where  eaq  =  ebq  »  Aj  for  all  j.  This  dependence  arises 
because  a  pure  A  or  pure  B  cluster  has  large  strain  energy  in  the  alloy 
whose  average  lattice  constant  follows  Vegard's  law  (see  Fig,  1.5). 

Given  these  x  dependences  of  ea  and  eb  the  leading  mixing  enthalpy  term  in 
Eq.  (4.7.7)  becomes 

N  &  (yEA  +  xeb)  =  Nn'x(l-x)  +  Nil  (ebo  -  Eao)  x2(l-x) 

where 


Q'all  Ebo 
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Most  often  one  finds  ebo  *  cao  «  ebo  so  the  mixing  enthalpy  has  the  usual 
form  NO'  x(1-x)  that  was  assumed  in  Section  4.2,  Eq.  4.2.3. 

1 

Equations  (4.7.18  -  4.7.22)  simplify  greatly  if  the  kTUx)  ts^fris  can  be 

neglected.  There  are  some  exceptional  cases  where  gj  may  also  have  a 
slow  X  dependence  but  they  are  so  rare  that  we  have  not  included  this 
possibility  in  Eqs.  (4.7.17  -  4.7.22). 

Note  that  once  r\  is  deduced  by  solving  Eq.  (4.7.15),  dq/dx  can  be  found 
from  Eq.  (4.7.21).  Then  the  miscibility  gap,  the  spinodal  curve,  and  the 
critical  temperature  can  be  evaluated. 

if  Aj  contains  only  pair  potential  combinations,  then  AF  is  a  symmetric 
function  of  x  about  x  «  1/2  for  a  given  T.  The  miscibility  critical  points 
xi  and  X2,  if  they  exist  at  a  given  T,  are  determined  by  the  condition 
dAF/dx  «  0  and  are  related  by  xi  »  (1-X2)-  Similarly,  the  values  of  x'i  and 
x'2  of  the  spinodal  are  determined  by  the  condition  d^AF/dx^  .  o  and  are 
also  related  by  x'i  -  (l-x’2).  The  critical  temperature  Tc  arises  from  the 
special  case  where  xi  -  X2  -  x'i  -  x’2  -  1/2  and  32  AF/ax2  «0.  At  x  -  1/2, 
the  solution  for  r\  isq-l.  This  result  can  be  obtained  by  inspecting  Eq. 
(4.7.15)  and  knowing  that  interchanging  the  A  and  B  atoms  does  not  change 
the  number  of  AB  pairs  in  a  cluster  so  Aj  -  An.j,  and  also  gj  -  gn-j.  Using 
this  Ti  value,  one  can  express  3lnTi/3x  at  x  -  1/2  explicitly  in  terms  of  T. 
Eqs.  (4.7.19)  and  (4.7.21)  then  lead  to  the  following  explicit  equation  for 
the  critical  temperature  Tc. 

-2eg/kT  +  n2(i~i?^]  +  =r^  =  0  (4.7.23) 

I  a  / 

■'  4 

where  for  simplicity  (EBo-fiAoj/kTandj^^  have  to  be  set  to  zero. 

3x 
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For  the  case  in  which  the  clusters  are  pairs,  i.e.,  QCA,  we  have  n  »  2,  and 
-  z/2,  where  z  is  the  coordination  number.  Then  Eq.  (4.7.23)  reduces  to 
the  following  simple  equation 

Ai  /  kTe  =  -  ln(|^  +  4^)  (4.7.24) 

Comparing  Eqs.  (4.7.23)  and  Eq.  (4.2.16)  we  see  they  are  superficially 
similar,  but  in  fact  are  quite  different.  The  reason  for  the  difference  is 
that  the  x  depencence  of  the  {Aj}  set  is  not  properly  treated  in  the 
derivation  of  Eq.  (4.2.16).  The  ratio  of  the  critical  temperature  to  Ai  for 
different  eeo/Ai  ratios  are  in  Table  4.7.1. 


Table  4.7.1 


Critical  temperature  for  different  coordination  numbers  z,  and  ebo/A 
ratios. 


z 

4 

12 

Ebo/A  1 

kT  c/Ai 

kTc/Ai 

0 

1.44 

5.49 

5 

6.90 

20.8 

10 

11.9 

36.0 

15 

16.9 

51.3 

As  expected,  as  the  ratio  ebo/Ai  increases  Tc  increases,  because  ebo 
becomes  the  proper  reference  energy. 


It  is  common  to  find  ratios  ebo  /  s  10  so  theories  that  do  not  account 
properly  for  the  x  depencence  of  the  energies  are  inaccurate. 


The  chemical  potential  ps  ^or  B  atoms  in  the  Ai-xBx  or  Ai-xBxC  alloy 
systems  is  defined  as 


aAF  dAF 

PBS - = - 

aNB  Nax 


(4.7.25) 
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_  Eao  “  Cbo  1  ^ 

Then  from  Eq.  (4.7.18)  and  using  the  approximation  kf  ’  kT  ^  ^ 

the  leading  terms  are 

Pb  =  (l-2x)  Ebo  +  na  Pbr  +  kT  (1-na)  ln(x/y)  (4.7.26) 

where  the  reduced  chemical  potantial  pbr  is  related  to  the  reduced 
activity  coefficient  ti  by 


PBR  =  k^lnTi  . 


(4.7.27) 


The  quantity  pbr  is  the  contribution  to  the  chemical  potential  stemming 
from  the  "state  of  order"  of  the  alloy. 

To  gain  some  insight  into  the  behavior  of  pbr  examine  Eq.  (4.7.15)  in  the 
case  where  J  -  n  and  nj  -  j.  Expand  Aj  in  a  power  series  in  j 

Aj  =  XSij‘  (4.7.28) 


As  we  will  see  in  Sec.  4.9  it  is  often  sufficient  to  retain  terms  in  this 
expansion  to  j^.  To  allow  us  to  obtain  a  simple  analytical  form  that 
displays  the  major  physical  trends  of  pbr  we  will  also  replace  j^sjj  . 
Then  Eqs.  (4.7.28)  and  (4.7.15)  become 


nx  =  X  j  gj[‘nc-<*>'^>ATji/2 
j-o 


nT| 
1  + 


(4.7.29) 


or 


Pbr  =  (5i  +  nx52)  +  kT  In  x/y  . 


(4.7.30) 


Therefore,  pbr  is  of  order  Aj  and  has  an  entropy  contribution  that 
dominates  at  high  temperatures.  This  entropy  contribution  exactly 
cancels  the  kTnjiln(x/y)  term  in  Eq.  (4.7.26)  so  it  simplifies  to 

Pb  =  ft  (l-2x)eBo  =  nC,(5i  +  nx52)  +  kT  In  (x/y) .  (4.7.31) 

The  entropy  contribution  to  the  chemical  potential  is  only  that  for  a 
random  alloy.  All  reference  to  modifications  caused  by  GQCA  have 
cancelled  out.  This  result  is  nearly  exact  in  the  temperature  limit  high 
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enough  so  {Aj}  «  kT  ln(x/y),  because  In  this  limit  none  of  the 
approximations  used  to  obtain  Eq.  (4.7.29)  are  needed. 


Note  that  the  cluster  distribution  function  in  Eq.  (4.7.12)  could  have  been 
written  directly  had  we  used  a  simple  grand  canonical  ensemble  for  the 
clusters  from  the  outset.  Then  r\  is  the  absolute  activity  for  the  B  atoms 
in  this  ensemble.  The  whole  derivation  can  also  be  done  using  a  steepest 
descents  argument  for  the  partition  function  (Sher  et  al.,  1987).  While 
these  methods  are  more  elegant,  the  essential  physics  is  the  same  as  the 
GQCA  derived  in  the  straightforward  formalism  above.  These  calculations 
can  be  summarized  In  four  steps;  (1)  calculate  the  cluster  energies  Ej(n) 
and  from  them  the  reduced  excess  energies  Aj  defined  in  Eq.  (4.7.4);  (2) 
solve  the  polynomial  equation  (4.7.14)  to  obtain  ii;  (3)  use  and  the 
energies  to  obtain  the  cluster  distribution  from  Eq.  (4.7.12)  and  use  this 
distribution  to  calculate  all  the  statistical  averaged  properties  for  the 
alloy;  and  (4)  in  particular,  use  the  set  {xj}  in  Eq.  (4.7.3)  to  obtain  the 
mixing  enthalpy  AE,  and  in  Eq.  (4.7.7)  to  obtain  AS,  which  then  enables 
phase-diagram  calculations  and  thermodynamic  studies. 

4.8  Internal  Strain  and  Cluster  Energies 

Strictly  speaking,  QCA  and  GQCA  imply  short-ranged  interaction  energies. 
However,  the  energy  models  in  Sec.  4.6,  which  are  consistent  with  the 
experiments,  indicate  that  strain  is  the  most  important  contribution  to 
mixing  energy  in  most  pseudo-binary  alloys.  The  exceptions  are  the  near 
lattice  constant  matched  alloys,  e.g.,  GaAIAs,  and  HgCdTe.  Since  the 
strain  energy  is  shared  among  bonds  throughout  the  crystal  and  is 
mutually  interactive  in  nature,  it  can  never  be  separated  into  isolated 
contributions.  To  write  the  mixing  enthalpy  as  a  sum  of  cluster  energies, 
these  cluster  energies  must  contain  contributions  extending  beyond  the 
cluster  size.  Such  an  effect  has  already  been  included  in  the  energy 
models  used  in  Section  4.6.  This  section  is  designed  to  provide  a  basis  for 
treating  the  internal  strain  energy  in  alloys  (Chen  et  al.,1987)  and 
obtaining  its  contribution  to  cluster  energies  for  insertion  into  the  GQCA 
calculation. 
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Consider  an  alloy  divided  into  M  nonoverlapping  clusters.  In  general,  alloy 
strain  energy  can  be  vrritten  as  the  sum  of  the  strain  energies  of  these 
clusters,  and  the  interaction  among  the  clusters.  Focus  on  a  particular 
cluster  denoted  n,  detach  this  cluster  from  the  alloy,  and  allow  both  the 
cluster  and  the  remaining  medium  to  relax.  When  the  cluster  is 
reconnected  to  the  medium,  there  will  be  a  change  in  the  energy.  This 
energy  change  denoted  hnm  is  defined  as  the  interaction  energy  between 
the  cluster  and  its  surrounding  medium  labeled  by  m.  This  the  total  strain 
energy  of  the  alloy  can  be  written  as 

E  =  £(en  +  ih„„,).  (4.8.1) 

Here  En  is  a  residual  strain  energy  for  the  n^^  cluster  when  it  is  detached 
from  the  alloy  and  is  in  its  equilibrium  configuration.  The  sum  in  Eq. 
(4.8.1)  is  over  all  clusters  in  the  alloy.  The  factor  1/2  is  inserted  to 
eliminate  double  counting.  In  an  alloy,  Eq.  (4.8.1)  can  be  replaced  by  an 
ensemble  average. 

E  =  M  ^  ^En  +  ^  hfunj  PnPnm*  (4.8.2) 

n  m 

where  pn  is  the  probability  that  a  cluster  will  be  of  the  n  type  (e.g., 
specified  by  the  number  of  A  and  B  atoms  and  their  arrangements)  and  Pnm 
is  a  conditional  probability  that  the  surrounding  environment  is  in  state  m 
when  the  cluster  is  in  state  n.  Note  that  in  Eq.  (4.8.1)  n  denotes  the 
location  of  the  cluster,  while  in  Eq.  (4.8.2)  n  designates  the  state  of  a 
cluster.  From  Eq.  (4.8.2),  an  effective  cluster  energy  E(n)  can  defined  as 

S  (n)  =  ^  ^En  +  ^  hiunj  Pnm«  (4.8.3) 

m  " 

These  energies  can  be  expressed  explicitly  within  the  following  elastic 
continuum  model. 

Consider  a  spherical  cluster  embedded  in  an  elastic  medium.  The 
interaction  energy  can  be  written  as 

hnm  =  ^Pn(an-r)2  +  ^Vm(bm-r)2  ,  (4.8.4) 

where  the  equilibrium  radius  r  is  to  be  determined  by  minimizing  hnm-  The 
lengths  an  and  bm  are  the  "natural"  radii  and  pn  and  vm  are  the  force 
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constants  of  the  cluster  and  the  medium  respectively.  For  a  macroscopic 
semiconductor  cluster  of  radius  R,  |x  ■  12jtBR  and  a  good  approximation  for 
V  is  V*  3.27t(Cii  -  Ci2  +  3C44)  (Chen  and  Sher,  1985).  The  value  of  r 
corresponding  to  the  minimum  value  of  hnm  is  given  by 


rnm  =  (ma„  +  Vmbm)/(Pn  +  v,„)  ,  (4.8.5) 

and  the  actual  interaction  energy  becomes 

hnm  =  ^FnVm(a„-bm)^/(Pn  +  Vm)  .  (4.8.6) 

Next  cast  the  average  energy  into  a  form  in  which  an  and  bm  are  decoupled. 
To  do  this,  an  effective  radius  re  is  introduced  such  that 

^  ^  Pn  Pm  I^V|n  (Sn  *  bm)^  /  (pn  "*■  Vm) 
n  m 

(4.8.7) 

“  Pn  PmMfl'^m  [(an'^e)^  +  (hm'^e)^]  /  +  VnO» 

n  m 

which  imposes  the  following  condition  on  re: 

S  2  Pn  PmPnVm  (an-fe)^  (b^  *  fe)^  /  (Pn  +  Vm)  =  0.  (4.8.8) 

n  m 


A  very  quick  and  important  result  for  the  internal  strain  energy  can  be 
obtained  from  these  two  equations  by  neglecting  the  correlation  between 
pn  and  Pnm  and  approximating  all  the  force  constants  by  the  concentration 
weighted  average  values  p  and  v  .  This  result  is:  re  =  <an>  =  <bm>, 

<(an-  i'e)2>  *  <(bn-  re)2>  =  <Ar2>  and 

E  =  M[e  +  ^pv<Ar2>/(p  +  v)j  ,  (4.8.9) 

where  e  is  the  average  value  <en>. 
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Eq.  (4.8.9)  shows  that  there  are  three  essential  features  contributing  to 
the  internal  strain  energy  in  this  cluster  theory:  the  mean  square 
fluctuation  of  the  cluster  radii  <Ar2>,  the  effective  force  constants  which 
are  ^v/(n  +  v),  and  the  mean  strain  energy  e  of  isolated  clusters. 

This  result  further  suggests  a  way  to  use  the  effective  medium  to  obtain 
cluster  energies  in  an  alloy.  For  example,  if  one  assumes  that  the 
effective  medium  has  an  effective  elastic  force  constant  v  and  radius  re  , 
then  replacing  the  medium  by  a  cluster  n  introduces  a  strain  energy  having 
the  same  expression  as  Eq.  (4.8.6)  with  vm  replaced  by  v  and  bm  by  re.  If 
all  the  strain  energy  is  assigned  to  this  cluster,  we  obtain  an  effective 
cluster  energy, 


ec(n)  =  e„  +  ^p„v(a„-re)2/(p„  +  v)  .  (4.8.10) 

We  can  see  that  the  alloy  mean  energy  E  «  M  £  Pnec(n)  reduces  to  the  result 
in  Eq.  (4.8.9)  when  the  pn  is  taken  to  be  the  mean  value  p.  The  effective 
medium  nature  enters  if  we  take  re  to  be  determined  by  the  condition  that 
the  average  energy  E  be  a  minimum,  i.e.. 


X  Pn  PnV(a„  -  le)  /  (Pn  +  V)  =  0  .  (4.8. 1 1) 

n 

Application  of  this  self-consistent  approach  to  the  elastic  constants  has 
already  been  considered  in  Section  3.7. 

Let  us  now  apply  the  general  ideas  behind  Eqs.  (4.8.10)  and  (4.8.11)  for  the 
cluster  energies,  and  GQCA  to  a  systematic  discussion  of  the  statistical 
properties  of  pseudobinary  alloys  Ai.xBxC.  Because  of  the  special  atomic 
arrangements  of  these  alloys,  the  bonds  are  the  natural  entities  to  use  for 
the  energy  counting  while  it  is  more  convenient  to  count  the  atoms  for  the 
statistical  arrangements.  All  the  bonds  in  these  alloys  can  be  clearly 
separated  into  AC  and  BC  bonds.  A  convenient  way  to  generate  the  cluster 
is  to  assign  the  four  AC  bonds  surrounding  an  A  atom  to  that  atom,  and 
similarly  assign  the  four  BC  bonds  to  the  central  B  atom.  In  this  way  the 
smallest  cluster  is  a  one-atom  four-bond  cluster.  Following  the  ideas 

behind  Eq.  (4.8.10)  we  start  with  an  effective  medium  and  replace  the 
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center  atom  by  an  A  atom,  the  four  surrounding  bonds  by  AC  bonds,  and 
calculate  the  interaction  energy  to  obtain  the  one-atom  cluster  energy  ea- 
In  a  similar  way  we  can  obtain  eb  by  replacing  the  cluster  with  a  B  atom 
and  its  four  BC  bonds.  Note  that  the  mixing  energy  AE  in  Eq.  (4.7,3)  is  not 
zero,  because  ea  and  eb  in  an  effective  medium  are  different  from  their 
respective  values  E^andEflin  their  respective  pure  constituent  crystals. 

If  the  effective  spring  model  of  Figure  3.7.3  is  used,  then  |x«  6aA,  v  »  2a, 
and  we  have  [following  Eqs.  (4.8.10)  and  (4.7.4)] 

where  5o  is  (djc  ‘  dSc)  with  d  being  the  VGA  bond  length.  A  similar 
expression  can  be  obtained  for  Ab.  When  d  and  all  the  a's  In  Aa  and  Ab  are 
replaced  by  their  mean  values,  the  resulting  AE  is  only  modified  by  a  few 
percent  from  those  based  on  the  actual  expressions  In  Eq.  (4.8.12),  The 
resulting  mixing  enthalpy  is  given  by  a  simple  expression  AE  ■  x  (1-x) 

3^d2/2.  This  is  the  same  result  as  that  given  in  Eq.  (4.6.13)  for  the 
mixing  enthalpy  parameter.  Since  each  cluster  contains  only  one  alloying 
atom  and  none  of  the  clusters  share  the  same  alloying  atom,  the  total 
possible  ways  to  arrange  the  clusters  Is  exactly  Oq  in  Eq.  (4.2.1),  so  that 
the  total  mixing  free  energy  in  this  model  is  precisely  Stringfellow’s 
preferred  form  given  in  Eq.  (4.6.2). 

The  next  simplest  cluster  set  for  the  pseudobinary  alloy  are  clusters  that 
contain  four  alloy  atoms  and  sixteen  bonds  as  depicted  in  Figure  4.8.1. 

This  is  the  smallest  cluster  size  that  can  provide  a  meaningful 
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fig  4.8.' 
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Figure  4.8.1  Schematic  picture  of  a  4-atom,  16-bond  cluster  in  a 
pseudobinary  alloy.  This  figure  shows  an  A2B2  cluster  in  a  cubic  cage. 
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description  of  the  local  statistical  correlations  among  the  alloying  atoms. 
Since  the  results  of  this  sixteen-bond  cluster  have  important  implications 
to  many  properties,  the  results  warrant  a  detailed  discussion,  which  will 
be  done  in  the  next  section. 

4.9  Sixteen-bond  Microclusters 

As  mentioned  earlier,  the  16-bond  clusters  shown  in  Figure  4.8.1  are  the 
smallest  non-overlapping  clusters  that  are  useful  to  examine  local 
statistical  correlations  of  alloy  atoms  in  a  pseudobinary  alloy  Ai.xBxC. 
Each  cluster  contains  four  alloy  atoms  A  and  B  on  the  vertices  of  a 
tetrahedron.  The  16  bonds  in  a  cluster  include  the  four  bonds  connecting 
the  four  alloy  atoms  to  the  central  C  atom,  and  the  twelve  bonds 
connecting  the  four  alloy  atoms  to  twelve  C  atoms  on  the  periphery.  In  the 
absence  of  external  stresses,  there  are  five  different  kinds  of  clusters 
distinguished  by  distinct  cluster  energies.  These  clusters  can  be  labeled 
according  the  the  numbers  of  A  and  B  atoms  in  the  cluster,  namely  A4Bo> 
A3B1,  A2B2,  A1B3,  and  A0B4,  to  be  labeled  by  j  -  0  to  J  with  J  •  n  ■  4, 
following  the  notation  in  Section  4.7.  Each  cluster  now  occupies  one  cube 
in  the  crystal.  Thus  the  ratio  of  the  number  of  clusters  M  to  the  number  of 
unit  cells  (or  C  atoms)  N  is  M/N  -  1/4.  Since  each  cluster  shares  no  alloy 
atoms  with  any  other  cluster,  the  total  number  of  alloy  configurations  <I> 
in  Eq.  (4.7.5)  for  a  given  set  of  numbers  of  clusters  (Mj)  is  exact.  Another 
consequence  of  the  fact  that  n(I  -  1,  as  can  be  seen  from  Eq.  4.7.16,  is  that 
some  entropy  terms  in  the  free  energy  vanish.  Any  error  lies  in 
approximating  the  Hamiltonian  as  a  sum  of  independent  cluster 
contributions. 

The  effective  cluster  energies  can  be  calculated  in  a  manner  similar  to 
what  was  done  for  the  one-atom  clusters  in  Sec.  4.8.  To  circumvent  the 
complicated  atomic  relaxation  that  can  occur  in  the  medium  outside  a 
cluster,  the  effective  spring  model  results  deduced  in  the  single  impurity 
case  can  be  extended  to  the  present  case.  To  be  more  explicit,  we  start 
with  a  VCA  medium  and  replace  sixteen  bonds  in  this  medium  by  a  cluster. 
The  atoms  in  the  cluster  including  the  twelve  peripheral  C  atoms  are 
allowed  to  relax,  but  the  atoms  in  the  medium  are  fixed  at  their  VCA 
sites.  To  compensate  the  imposed  rigidity,  the  bond-angle  restoring  force 
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P  in  VFF  is  taken  to  be  zero  for  all  interactions  involving  those  bonds 
connecting  the  peripheral  C  atoms  to  the  medium.  With  this 
simplification  the  strain  energies  and  atomic  positions  can  be  treated 
without  further  approximations  in  VFF  through  a  minimization  of  the 
strain  energy.  These  strain  energies,  according  to  Eq.  (4.8.10),  can  be 
assigned  as  the  strain  contribution  to  the  cluster  energy. 

Contributions  to  the  cluster  energies  from  modifications  of  the  chemical 
bonds  in  the  alloy  can  be  estimated  using  the  bond-orbital  model  (BOM)  of 
Sec.  2.5  by  calculating  the  changes  in  the  metallization  energy  of  each 
bond,  i.e,  Aet  in  Eq.  (2.5.7),  from  its  pure  crystal  value  (Chen  and  Sher, 
1985).  The  contributions  to  Aeb  due  to  antibonding  states  |a'>  coming  from 
the  same  cluster  can  be  treated  exactly,  while  those  from  the  other 
clusters  are  approximated  as  statistically  averaged  values  in  a  manner 
indicated  in  Eq.  (4.8.3). 

Figure  4.9.1  shows  the  calculated  cluster  excess  energies  as  a  function  of 
X  in  Gai-xIOxAs.  For  a  small  x  value,  the  ej  energies  are  larger  for  larger 
j,  because  a  larger  j  means  there  are  more  InAs  bonds  in  the  cluster,  and 
therefore,  it  is  harder  to  fit  It  into  a  medium  with  a  lattice  constant 
close  to  that  of  a  GaAs  crystal.  Similarly,  at  x  -  1/2,  62  for  the  A2B2 
cluster  is  the  smallest  energy  because  this  cluster  has  the  best  match  to 
the  50-50  VCA  alloy  lattice.  Clearly,  these  energies  are  dominated  by 
strain.  However,  it  is  often  misleading  to  judge  the  local  correlation  from 
these  energies.  To  do  this  it  is  better  to  look  at  the  reduced  cluster 
energies  Aj  defined  in  Eq.  (4.7.4)  because  it  is  these  energies  that  drive 
the  cluster  populations,  given  by  Eq.  (4.7.12). 

Figure  4.9.2  shows  the  reduced  cluster  energies  Aj  for  Gai-xInxAs.  By 
definition  Aq  and  A4  are  zero.  The  other  A’js  differ  drastically  from  the  ej 
set,  not  only  in  magnitude  but  also  in  their  x  dependence.  These  reduced 
cluster  energies  are  all  negative  and  have  a  weak  linear  dependence  on  x. 
The  three  lines  are  nearly  parallel  to  each  other  indicating  the  same  x 
dependence.  Ai  and  A3  are  nearly  equal  and  are  less  negative  than  A2.  The 
negative  values  of  the  Aj  qualitatively  imply  that  A  and  B  atoms  do  not 
like  to  segregate.  These  general  features  of  cluster  energies  are  found  to 
be  common  in  all  lil-V  and  ll-IV  alloys  with  appreciable  (few  percent) 
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Figure  4.9.1  Cluster  excess  energies  of  the  16-bond  clusters 
described  in  the  text  in  Gai-xInxAs  for  clusters  with  different 
numbers  of  In  atoms. 


Figure  4.9.2  The  corresponding  reduced  cluster  energies  Aj  for 
Gai-xIOxAs. 
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lattice  mismatches.  It  is  evident  from  Fig.  4.9.2  that  5i  and  82  in  Eq. 
(4.7.28)  have  opposite  signs  and  are  roughly  given  by  5i  *  -nSa  >  0,  so 
Aj  =  -  52i(4-j)  =  A2j(4-j)/4.  Then  the  energy  contribution  to  Mb  in  Eq. 
(4.7.31)  from  the  state  of  order  is  nil  (81  +  0x82)  =  A2{1-x).  When  x  is  zero 
there  is  a  small  negative  enthalpy  contribution  to  the  chemical  potential, 
i.e.,  to  the  free  energy  when  a  B  atom  is  added,  but  when  x  is  unity  the 
state  of  order  makes  no  enthalpy  contribution  to  the  free  energy  when  an 
extra  B  atom  is  added.  For  the  lattice-matched  alloys  such  as  Gai-xAIxAs 
and  Hgi-xCdxTe,  all  the  energies  are  small  and  the  effects  caused  by  {Aj} 
are  negligible. 

The  best  way  to  measure  the  local  correlation  is  to  look  at  the  average 
values  of  the  cluster  population  xj  «  Mj/M  against  those  in  a  random  alloy 
xj  given  in  q.  (4.7.6)  with  gj  given  in  Eq.  (4.7.15).  The  average  cluster 
populations  can  be  calculated  from  Eq.  (4.7.12)  after  q  is  determined  by 
solving  Eq.  (4.7.14).  Figure  4.9.3  shows  a  plot  of  Mr  determined  from 
Tj  =  eM«i«/kT  ggygral  temperatures  as  a  function  of  x.  These  curves  were 
determined  numerically  and  have  none  of  the  approximations  inherent  in 
Eq.  (4,7.28).  The  shapes  of  the  curves  are  just  what  would  be  expected 
from  the  trends  predicted  by  Eq.  (4.7.28).  The  reduced  chemical  potential 
for  B  atoms  Mbr  is  large  and  negative  for  small  x  and  large  and  positive 
for  large  x  because  of  the  entropy  terms,  and  small  near  the  center  of  the 
x  distribution  where  the  Aj's  are  influential.  Figure  4.9.4a  and  b  show 

plots  of  the  deviations  from  randomness,  X]  -  xf  for  different  clusters  as  a 
function  of  alloy  concentration  x  at  equilibration  temperatures  of  600K 
and  1500K  respectively.  These  two  sets  of  curves  are  similar  in  shape  but 
different  in  magnitude.  As  expected  from  the  fact  that  Ai,  A2,  A3,  <  0  and 
Ao  -  A4  -  0,  the  j  «  1 ,  2,  3,  cluster  populations,  where  they  are  large,  are 
seen  to  be  enhanced  by  the  A],  while  the  j  »  0,  and  4  populations  are 
reduced.  For  referfence  the  random  populations  jxf)  as  a  function  of 

concentration  are  in  Fig.  4.9.4c.  These  cluster  populations  are  seen  to  be 
sensitive  to  temperature.  For  example,  at  x  =  0.54,  the  deviation 
decreases  from  25%  to  5.5%  at  T  increases  from  600  to  1500  K.  For  a 
given  sample,  the  actual  temperature  to  be  used  to  calculate  the  cluster 
population  depends  on  the  "last"  temperature  Tf  at  which  the  alloy  atoms 
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cease  to  diffuse.  If  Tf  is  larger  than  the  sample  temperature  T,  then  the 
alloy  is  in  a  metastable  state,  and  Tf  instead  of  T  should  be  used. 

After  the  cluster  distribution  is  calculated,  the  alloy  mixing  enthalpy  is 
determined  from  Eq.  (4.7.3).  Figure  4.9.5  shows  AE  as  a  function  of  x  for 
Gai-xInxAs  at  four  temperatures:  T  =  300,  600,  1000,  and  1500  K.  The  AE 
is  seen  to  be  dominated  by  the  T-independent  part  given  by  the  first  two 
terms  on  the  right  side  of  Eq.  (4.7.3),  which  can  be  attributed  to  the  strain 
energy.  The  T-dependence  is  seen  to  be  very  weak,  because  the  AjS  are 
small.  AE  increases  with  T  because  that  promotes  increases  of  population 
of  clusters  with  higher  energies.  The  curves  in  Fig.  4.9.5  indicate  that  the 
approximation  AE  =  x(1-x)Q  with  a  constant  positive  Q,  such  as  that  used 
in  Sec.  4.7,  should  not  introduce  much  error  in  the  free  energy  calculation. 
However,  a  positive  value  of  Cl  does  not  necessarily  imply 
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Figure  4.9.3  Reduced  chemical  potential  pbr  as  a  function  of 

concentration  x  for  several  temperatures  T  ■  300,  600,  1500  K  in  the 
alloy  Gai-xInxAs. 


lit 


Figure  4.9.4  Deviations  of  cluster  populations  in  Gai-xInxAs  from 
random  distribution  at  (a)  T  ■  600  K  and  (b)  1500  K.  For  reference  the 
random  populations  {x]}  are  plotted  against  x  in  (C). 


fig.  4.9.4c 
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Figure  4.9.5  Mixing  enthalpies  of  Gai-xIoxAs  as  a  function  of  x  at  four 
temperatures. 
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that  A  and  B  atoms  repel  each  other  microscopically.  This  is  evident  from 
Figure  4.9.5  which  shows  enhanced  population  for  the  A2B2  clusters 
relative  to  those  of  a  random  alloy. 

The  total  mixing  free  energy  ,  AF,  can  then  be  calculated  from  AE  -  TaS 
with  AS  given  by  Eq.  (4.7.7).  The  results  for  Gai-xInxAs  are  shown  in 
Figure  4.9.6.  These  curves  show  that  this  alloy  has  a  miscibility  gap  at 
low  temperature.  These  results  are  similar  to  those  based  on  pair- 
potential  QCA.  Therefore,  it  is  not  surprising  that  replacement  of  the 
pair-QCA  AF  for  pseudobinary  solid  solutions  by  the  present  GQCA  results 
produce  mearly  the  same  phase  diagrams  as  those  shown  in  Fig.  4.5.4 
(Patrick  et  al.,  1987).  This  also  shows  that  the  phase  diagrams  are  in  the 
class  of  phenomena  that  are  rather  insensitive  to  details  of  the  short 
range  order  statistical  distributions.  Effects  sensitive  to  their  local 
environment,  e.g.,  vacancy  formation,  are  more  sensitive  to  the  short 
range  order  state,  than  those  like  phase  diagrams  that  are  related  to 
average  properties. 

A  byproduct  of  these  calculations  is  the  bond  length  distribution  in  alloys. 
Figure  4.9.7a  shows  the  bond  lengths  from  the  central  C  atoms  to  the  four 
alloy  atoms  as  a  function  of  x  for  ail  five  clusters  considered.  The 
spreads  in  the  lengths  of  the  same  kind  of  bond  in  different  clusters  are 
found  to  be  small.  Figure  4.9.7b  shows  their  average  values,  and  the  rms 
widths  about  the  mean  of  Ga-As  and  In-AS  bonds  over  the  cluster 
populations.  The  calculated  result  clearly  supports  the  bimodal  bond 
length  distribution  revealed  by  EXAFS. 

Besides  Gai.xinxAs,  the  above  studies  have  been  carried  out  for  several 
other  pseudobinary  alloys,  including  Gai-xInxP,  GaAsi-xSbx,  Hgi-xCdxTe, 
Hgi-xZnxTe,  Cdi.xZnxTe  and  ZnSei-xTex  (Patrick  et  al.,  1987  and  1988). 
Among  quantities  that  can  be  checked  accurately  against  experiment,  the 
bond  lengths  and  the  phase  diagrams  from  the  16-bond  cluster, 
generalized  QCA  calculations  compare  favorably  with  experiments.  As 
further  evidence.  Fig.  4.9.8  shows  the  liquidus-solidus  curves  for  three  II- 
VI  pseudobinary  alloys  based  on  the  16-bond  GQCA  calculations  for  the 
solid  phases  (Patrick  et  al.,  1988).  Agreement  between  the  predictions 
and  the  experiments  are  very  good.  The  effective  mixing  enthalpy 
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Figure  4.9.8  Calculated  liquidus-solidus  curves  for  three  li-VI 
pseudobinary  alloys  using  the  16-bond  GQCA  results  for  the  solid 
phase  and  their  comparison  with  experiments.  After  Patrick  et  al. 
(1988). 
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parameters  for  completely  disordered  alloys,  i.e.  random  alloys, 
derived  from  the  present  16-bond  model  for  a  number  of  systems  are 
listed  in  Table  4.6.2.  These  are  seen  to  be  in  accord  with  the  experimental 
values  and  with  those  from  previous  estimates.  However,  the  attractive 
interactions  between  the  alloy  atoms,  and  the  associated  changes  in  the 
cluster  populations  found  from  the  calculations,  have  not  been  clearly 
demonstrated  experimentally. 

4.10  Cluster  Variational  Method 

The  mixing  energy  models  and  statistical  methods  discussed  so  far  have 
provided  an  adequate  description  of  local  correlation  and  phase  diagrams 
in  pseudobinary  semiconductor  alloys.  All  the  results  in  these  models 
were  obtained  without  relying  on  heavy  computations.  However,  modern 
computers  do  enable  sophisticated  calculations,  and  more  rigorous 
calculations  should  be  considered.  This  section  will  discuss  an  improved 
statistical  theory,  and  the  next  section  will  review  recent  efforts  to 
combine  this  statistical  method  with  first-principles  LDA  calculations 
for  semiconductor  pseudobinary  alloys. 

A  statistical  method  first  proposed  by  Kikuchi  (1951)  which  improves 
GQCA  by  incorporating  some  statistical  correlation  between  clusters,  is 
referred  to  as  the  cluster  variational  method  (CVM).  Like  GQCA,  CVM 
divides  a  crystal  into  clusters  which  are  chosen  to  contain  all  the 
important  interactions.  There  are  several  approaches  to  obtain  the  CVM 
cluster  configurations.  The  method  we  shall  consider  is  perhaps  the 
simplest  one.  Unless  specified,  we  shall  ignore  the  spectator  C  atoms  in 
the  pseudobinary  Ai.xBxC  alloy,  then  the  statistical  counting  is  the  same 
as  that  in  the  binary  Ai-xBx-  We  will  use  a  familiar  notation.  There  are  N 
lattice  sites  equal  to  the  number  of  atoms  N  «  Na  +  Nb-  M  is  the  number  of 
basic  clusters  in  the  crystal.  For  example,  if  the  clusters  are  pairs,  then 
M  «  zN/2,  where  z  is  the  coordination  number.  If  the  basic  cluster  is  a 
square,  then  M  -  N  in  a  square  lattice,  but  M  «  3N  in  a  simple  cubic  lattice. 
The  quantity  of  interest  is  the  number  of  ways,  a>,  of  arranging  the  A  and  B 
atoms  on  the  lattice  for  a  specified  set  of  the  numbers  of  different  types 
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of  clusters  {Mj}.  If  these  clusters  were  all  independent,  then  the  answer 
would  be 

g(M.s)  =  M!/j'nMj!j  .  (4.10.1) 

where  the  index  s  labels  the  basis  cluster  being  used,  e.g.  pair,  triangle, 
square,  etc.  However,  these  basis  clusters  in  a  crystal  share  subclusters, 
e.g.,  squares  may  share  pairs.  Therefore,  the  counting  g(M,s)  has  to  be 
modified.  The  subclusters  can  also  share  further  subclusters,  and  further 
corrections  have  to  be  made  until  reaching  the  final  smallest  subclusters 
-  the  single-atom  clusters  to  be  referred  to  as  points.  The  ways 
systematic  corrections  are  made  can  be  best  understood  by  going  through 
several  examples. 

Consider  the  simplest  case  in  which  the  basis  clusters  are  pairs.  Then  M  - 
zlM/2,  and  the  uncorrected  O  is  given  by  g(M,pair)  in  Eq.  (4.10.1).  These 
pairs  share  points.  The  M  independent  pairs  would  contain  2M  points, 
whereas  there  are  only  N  points  in  the  crystal.  A  logical  way  to  correct 
g(M,pair)  is  to  normalize  it  with  ratio  of  point  arrangements  as  follows 

O  =  g(M,pair)  [g(N,point)  /  g(2M,point)],  (4.10.2) 

where  g(N,point)  following  the  definition  of  Eq.  (4.10.1)  is  the  number  of 
random  configurations  <l>o  in  Eq.  (4.2.1)*  .  Using  the  Stirling 
approximation  InN!  >  NInN  -  N,  the  mixing  entropy  AS  -  kInO  becomes 

AS  =  -NkZcj  Incj  -  Mk[2^jyjlnyj  -  2Zcjlncj]  ,  (4. 10.3) 

where  for  latter  convenience  the  notation  ci  and  C2  stand  for  fractions  x 
and  (1-x)  respectively,  yj  -  Mj/M  is  the  pair  probability  of  cluster  type  j, 
and  hj  is  the  degeneracy  of  cluster  type  j.  Note  that  in  the  above  notation, 
Mj  is  the  number  of  clusters  of  the  type  identified  by  specifying  the 
arrangement  of  A  and  B  atoms  in  the  cluster,  it  does  not  include  other 
types  with  the  same  cluster  energy.  Thus  hjyj  is  equivalent  to  the  cluster 


*ln  terms  of  the  logic  used  to  get  Eq.  (4.7.5),  O  equals  the  number  of  arrangements  of  the  Na 
and  Nb  atoms  on  the  N  sites,  g(N,point)  times  the  fraction  of  those  arrangements 
g(M,pair)/g(2M,  point)  corresponding  to  the  pair  arrangements. 
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population  xj  in  QCA.  With  this  understanding,  then  the  CVM  is  exactly  the 
same  as  QCA  in  the  pair  approximation. 


Next  consider  square  clusters  on  a  square  lattice.  Let  M  now  be  the  total 
number  of  squares  in  the  crystal,  then  M  «  N.  The  number  of  pairs  in  the 
crystal  is  actually  L  «  2N.  If  all  the  M  squares  were  indr  pendent,  e.g., 
separated  from  one  another,  then  the  total  number  of  pairs  would  be  4M 
and  the  total  number  of  atoms  (points)  would  also  be  4M.  The  first 
correction  to  g(M, square)  comes  from  the  following  pair  normalization 
factor 


.  g(L.pair) 
"P"'''g(4M.pair)  ' 


(4.10.4) 


However,  both  the  denominator  and  numerator  of  this  factor  have  to  be 
further  corrected  with  point  normalizations  in  a  form  similar  to  that 
inside  the  bracket  of  Eq.  (4.10.2).  Explicitly,  the  corrected  O  takes  the 
form 


O  =  g(M,square) 


g(L.pair) 

g(4M,pair 


)( 


g(8M,point) 

g(4M,point) 


g(2L,point) 
ig(N, point) 


(4.10.5) 


In  this  equation  the  first  bracket  is  the  pair  normalization,  the  second 
bracket  is  the  point  normalization  to  correct  g(4M,pair),  and  the  last 
bracket  is  to  correct  g(L.pair).  The  associated  mixing  entropy  becomes 


AS  =  -k[(8M  -  4M  -  2L  +  N)  Lcj  Incj  +  (L  -  4M)  Zhjyj  Inyj  +  MXgjzj  Inzj] 
=  -  kN(£ej  Incj  -  2Lhjyj  Inyj  +  Lgjzj  Inzj)  , 


(4.10.6) 


where  yj  and  Zj  are  respectively  the  pair  and  square  probabilities,  and  hj 
and  gj  are  their  respective  degeneracies. 

Now  it  becomes  straightforward  to  write  down  the  expression  for  AS  for 
square  clusters  in  a  (3-dimensionai)  simple  cubic  lattice,  for  which  M  - 
3N,  L  «  3N,  and  the  expression  for  <I>  in  Eq.  (4.10.5)  is  still  valid,  so 

AS  =  -kN  (vZcj  Incj  -  9Zhjyj  Inyj  +  3£gjZj  Inzj)  .  (4.10.7) 
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As  a  final  example,  consider  the  tetrahedron  clusters  in  a  fee  lattice. 
There  are  a  total  of  M  «  2N  tetrahedral  clusters,  and  L  «  6N  pairs  in  the 
crystal,  and  there  are  6  sides  and  4  atoms  in  a  tetrahedron.  Eq.  (4.10.5) 
has  to  be  modified  slightly  for  the  present  case  to  read 


O  =  g(M,square) 


g(L,pair)  \  |g(12M.point)K  |g(2L.point)\ 
,g(6M,pair )  \  g(4M,point)  '  \  g(N,point) ) 


The  entropy  is  then  given  by 


(4.10.8) 


AS  =  -kN  (sZcj  Incj  -6Ihjyj  Inyj  +  2LgjZj  Inzj)  .  (4. 10.9) 

Up  to  now  we  did  not  have  to  distinguish  between  binary  and  pseudobinary 
alloys.  However,  for  the  tetrahedron  case  there  are  two  different  kinds  of 
tetrahedra  in  pseudobinaries,  one  contains  a  central  C  atom,  and  the  other 
is  empty.  If  the  energies  are  assumed  to  be  associated  only  with  bonds, 
then  we  can  ignore  those  tetrahedra  without  a  central  C  atom.  Then  M  -  N 
should  be  used  in  Eq.  (4.10.8)  to  give 

AS  *  -kN  (-  32cj  Incj  +  LgjZj  Inzj).  (4.10.10) 

Note  that  the  pair  contribution  disappears  from  this  expression.  This  is 
understandable,  because  the  clusters  treated  this  way  share  only  points, 
it  is  important  to  point  out  that  Eq.  (4.10.10)  is  exactly  the  same  as  the 
GQCA  result  in  Eq.  (4.3.7)  for  the  case  of  tetrahedra  with  the 
identification  of  xj  with  gjzj.  This  example  demonstrates  that  alloy 
statistics  for  pseudobinaries  are  not  necessarily  identical  to  that  for 
binaries. 

It  is  useful  to  observe  a  rule  that  governs  the  coefficients  in  front  of  the 
summation  signs  inside  the  brackets  for  all  CVM  expressions  for  AS:  If  we 
multiply  each  coefficient  by  the  number  of  atoms  in  the  cluster  that  term 
represents,  and  sum  up  these  products,  the  result  is  one.  This  rule  is 
useful  for  checking  if  the  configurations  are  properly  normalized. 

The  equilibrium  values  of  cluster  probabilities  such  as  {yj}  and  {zj}  are 
determined  by  minimization  of  the  mixing  free  energies  in  a  manner 
similar  to  GQCA.  Likewise,  not  all  cluster  probabilities  are  independent. 
Since  there  are  more  variables  to  be  determined  and  more  constraint 
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conditions  to  be  satisfied  in  CVM  than  in  GQCA.  Kikuchi  (1981)  worked 
with  the  grand  potential  <|)  =  F  -  SpjNj,  and  designed  an  iterative  procedure, 
the  so-called  "natural  iteration  method",  to  attack  the  problem. 

Impressive  results  have  been  obtained  from  CVM  with  small  clusters.  For 
example.  Table  4.2.1  shows  that  CVM  predicts  critical  temperatures  very 
well  for  the  square  lattice  using  square  clusters,  and  for  the  fee  lattice 
using  tetrahedron  clusters.  There  are  also  examples  where  CVM  obtains 
correct  phase  diagrams,  whereas  the  Bragg-Williams  approximation  fails 
quantitatively  (Kikuchi,  1974).  This  method  has  recently  been 
incorporated  in  the  phase  diagrams  studies  of  semiconductor  alloys  using 
energies  derived  from  ab-initio  LDA  calculations.  The  results  will  be 
discussed  in  the  next  section. 

4. 11  Ab-Initio  Calculations 

Because  self-consistent  LDA  has  been  successful  predicting  the  structural 
properties  of  ordered  semiconductors,  it  is  desirable  to  attempt  to  apply 
this  ab-initio  theory  to  alloys.  There  are  several  challenges  in  such  an 
undertaking.  First,  the  mixing  enthalpies  In  semiconductor  alloys  are 
typically  of  order  of  1  Kcal  per  mole  or  1/23  eV  per  atom,  but  the  total 
energies  i.e.,  each  term  on  the  right  side  of  Eq.  (4.7.2)  are  many  orders 
larger.  Very  precise  and  accurate  numerical  calculations  are  required  to 
get  meaningful  results.  However,  if  carried  out  precisely,  LDA  has  been 
demonstrated  to  be  able  to  predict  changes  in  energies  correctly,  despite 
its  larger  error  in  predicting  total  energies.  The  next,  and  more 
challenging  difficulty,  is  alloy  disorder.  To  compute  the  total  energy  of 
an  alloy  quantum  mechanically  one  needs  to  solve  the  Schrodinger  equation 
for  a  Hamiltonian  which  does  not  have  lattice  translational  symmetry, 
which  is  so  indispensable  in  the  traditional  band-structure  theory. 

Finally,  to  use  a  statistical  theory  such  as  CVM,  the  excess  energy  has  to 
be  composed  of  short-ranged  multi-site  correlation  energies,  including 
only  the  single-site,  pair,  etc.,  up  to  a  manageable  cluster  size  containing 
only  a  handful  of  sties.  To  circumvent  these  difficulties,  Connolly  and 
Williams  (1983),  working  on  metal  alloys,  proposed  that  these  multi-site 
correlation  energies  be  deduced  from  ordered  systems  that  are  composed 
of  the  same  atoms.  This  scheme  allows  a  direct  application  of  the  first 
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principles  theory  in  the  calculation  of  the  energy  parameters.  This  theory 
has  been  extended  to  semiconductor  alloys,  including  considerable 
refinements,  by  Wei  et  al.  (1990)  and  Ferreira  et  al.  (1989)  (to  be  referred 
to  as  WF),  and  respectable  results  have  been  obtained  for  the  phase 
diagrams  and  alloy  equilibrium  properties.  Their  main  results  are 
summarized  below. 

The  cluster  excess  energies  in  WF  are  calculated  as  a  function  of  relative 
cell  volume.  Figure  4.11.1  shows  an  example  for  Ga4AsnSb4.n  clusters. 
These  energy  curves  are  similar  to  the  x-dependent  excess  energies  shown 
in  Fig.  4.9.1,  because  the  alloy  cell  volume  was  scaled  linearly  with  x.  The 
overall  implications  to  these  excess  energies  are  also  very  much  in  accord 
with  what  we  learned  from  Sec.  4.9.  The  magnitudes  of  mixing  enthalpies 
are  generally  dominated  by  terms  driven  by  lattice  constant  mismatches. 
For  the  lattice-matched  alloys,  the  excess  energies  are  very  small.their 
effects  fall  within  calculational  uncertainties,  and  therefore  may  be 
neglected.  For  lattice  mismatched  systems,  AE  is  positive  and  as  we  have 
seen  in  Sec.  4.9  is  dominated  by  a  temperature  independent  strain 
contribution.  The  reduced  excess  energies  that  govern  the  atomic 
statistical  correlation  are  small  and  negative.  This  implies 
anticlustering,  as  shown  in  Figure  4.11.2  for  the  deviation  of  cluster 
populations  from  random-alloy  values.  These  curves  are  qualitatively  the 
same  as  those  obtained  in  Sec.  4.9.  The  mixing  enthalpy  parameters  Q  for 
random  alloys  (at  T  -  «>)  were  calculated  and  are  listed  in  Table  4.6.2, 
where  they  can  be  compared  with  previous  estimates.  The  miscibility 
gaps  and  spinodals  are  also  calculated  and  are  shown  in  Figure  4.11.3. 
These  curves  display  considerable  asymmetry  about  x  -  1/2,  a  behavior 
also  found  from  the  experimental  data  for  GaSbi-xAsx.  The  calculated 
critical  temperatures,  indicated  as  Tmg>  and  the  concentration  xmg  where 
the  bimodal  curves  peak,  are  tabulated  in  Table  4.11.1.  For  systems  for 
which  experimental  data  are  available,  the  agreement  between  theory  and 
experiments  are  good.  Finally,  the  predication  of  the  bimodal  bond  length 
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fig.  4.11.1 
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Figure  4.11.1  Volume  dependent  excess  energies  for  Ga4AsnSb4-n 
clusters  obtained  by  Ferreira  et  al.  (1989). 
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fig.  4.11.2 
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Figure  4.  11.2  Deviations  of  cluster  populations  from  random 

distributions  for  five  lll-V  pseudobinary  alloys  calculated  by  Wei  et 
al.  (1990). 


distributions  are  similar  to  those  predicated  from  the  16-bond  cluster 
model  in  Sec.  4.9.  However,  we  should  note  that  in  order  to  produce  the 
kind  of  accuracy  in  the  energies  required  for  the  phase  diagram 
calculation  very  detailed  and  careful  computational  efforts  are  needed. 
The  results  presented  here  represent  the  state  of  the  art  in  the  LDA 
calculations. 


Table  4.11.1 

Calculated  critical  temperatures  Tmg  for  the  binodal  and  spinodal  curves 
and  the  corresponding  compositions  xmq  compared  to  experiments. 


System 

IHiSI 

lEiiSH 

Ali-xGaxAs 

64 

0.49 

961 

0.676 

933 

0.62 

630 

0.77 

277 

0.603 

SiS9R5!il 

1080 

0.595 

Hgi-xCdxTe 

84 

0.60 

455 

0.56 

605 

0.623 

After  Wei  et  al.  (1990) 


However,  the  Connolly-Williams  approach  is  only  a  kind  of  interpolation 
between  ab-initio  energies  of  the  ordered  systems  and  alloys.  The  multi¬ 
site  correlation  energies  deduced  from  this  approach  represent 
interpolation  parameters  for  the  total  energies.  They  depend  on  the 
ordered  systems  chosen  in  the  parametrization.  The  volume  dependent 
cluster  energies  are  unlikely  to  be  sufficient  for  calculating  alloy 
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properties  sensitive  to  the  cluster  shapes,  e.g.,  shear  moduli  and  lattice 
vibrations.  These  are  fine  points  that  require  improvements.  A  different 
way  to  approach  the  ab-initio  calculation  is  to  attack  the  disordered 
problem  directly.  If  the  fluctuations  of  the  alloy  potentials  from  their 
virtual  crystal  approximation  (VGA)  is  small,  then  the  next  leading 
correction  to  VGA  can  be  obtained  from  perturbation  theory.  This  should 
work  for  most  semiconductor  alloys  except  for  systems  with  large 
potential  fluctuations  such  as  Hgi-xGdxTe  (Spicer  et  a.,  1982).  A  more 
general  but  more  difficult  approach  is  to  extend  an  effective  modern  alloy 
theory,  the  coherent-potential  approximation  (GPA)  (to  be  discussed  in 
Ghapter  5),  to  clusters  and  to  achieve  triple  self-consistency:  consistency 
between  the  cluster  distribution  and  the  Hamiltonian,  between  the 
Hamiltonian  and  the  electron  density,  and  between  the  self-energy 
operator  £  in  the  cluster  GPA  theory  and  the  potential  fluctuations. 
Gurrently  this  theory  has  been  carried  out  only  for  metal  alloys,  and  only 
within  the  single-site  KKR-GPA  (Gyorffy  and  Stocks,  1979)  with  a  random 
distribution.  There  is  also  a  theory  developed  along  with  CPA,  the  so- 
called  generalized  perturbation  method  GPM  (Ducastelle  and  Gautier, 

1976),  which  is  intended  to  systematically  calculate  multi-site 
correlation  energies  from  the  CPA  results.  However,  to  use  GPM  in 
semiconductors,  it  needs  to  be  extended  to  include  total  energy 
contributions  form  other  than  the  band  structure  term  in  EQ.  (2.2.13)  and 
to  allow  lattice  distortions.  Major  work  is  needed  if  this  approach  is  to 
achieve  the  same  degree  of  rigor  for  disordered  alloys  as  self-consistent 
density  functional  theory  now  enjoys  dealing  with  crystalline 
semiconductors. 
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AbetracL  We  review  the  current  status  of  knowledge  of  fundamental  properties  of 
the  alloy  Hg,  _  ,Cd,Te.  The  most  vexing  questions  are  about  its  correlation  state. 
Several  different  experiments  now  suggest  it  is  highly  correlated,  but  no  theory 
predicts  this  result.  We  also  discuss  other  properties,  including  dislocations  at 
interfaces,  the  residual  donor,  worms,  surface  segregation  and  its  impact  on 
passivation,  and  concentration  fluctuations.  The  forces  driving  these  phenomena, 
where  they  are  known,  will  be  presented.  Most  of  the  paper  focuses  on  the 
following;  correlations;  native  defects,  formation  enthalpies  and  entropies;  native 
defect  equilibria  with  mercury  gas  and  with  tellurium  inclusions;  and  self-diffusion 
coefficient  activation  energies  including  its  contribution  from  migration  energies. 
We  will  take  advantage  of  new  first-principles,  high-accuracy  calculations  to  help 
explain  the  experimental  situation.  The  calculations  predict  that  the  main  native 
detects  found  in  alloys  equilibrated  at  low  Hg  pressures  are  Hg  vacancies,  while  at 
high  Hg  pressures  they  are  Hg  interstitials,  and,  surprisingly,  Hg  antisites. 


1.  Introduction 

As  we  have  been  told  in  this  conference  [1],  sophisticated 
focal  plane  arrays  (fpas)  of  sizes  as  large  as  256  x  256 
elements  have  been  demonstrated  on  Hg,  .jCd^Te  ma¬ 
terials  grown  epitaxially  by  several  means  on  various 
substrates.  Control  of  the  material  and  its  processing  is 
reaching  a  stage  where  practical  yields  of  arrays  with 
adequate  performance  for  some  applications  are  nearly 
at  hand,  so  these  array  costs  are  becoming  acceptable. 
While  the  community  is  now  well  into 'a  manufacturing 
phase,  there  remain  many  unanswered  questions  about 
the  nature  of  Hgt.^Cd^Te  alloys,  questions  whose  reso¬ 
lution  would  improve  the  performance,  yield  and  stabili¬ 
ty  of  focal  plane  arrays.  The  purpose  of  this  paper  is  to 
review  the  current  status  of  our  knowledge  of  this  alloy 
system  [2,  3],  with  the  aim  of  suggesting  potential 
mechanisms  for  the  uncertain  properties  and  additional 
experiments  to  test  these  hypotheses.  We  also  will  dem¬ 
onstrate  the  merit  of  recently  developed  first-principles 
computational  methods  [4]  to  help  settle  outstanding 
problems  associated  with  the  material  which  will  even¬ 
tually  lead  to  accurate  processing  and  performance 
models.  Figure  1  depicts  schematically  the  major  issues 
that  will  be  addressed.  They  were  selected  because  of 
their  potential  impact  on  performance  of  fpas.  The  figure 
shows  a  passivated  HgCdTe  sample  on  a  substrate.  On 
the  right  is  a  graph  of  a  vertical  composition  profile 
through  one  region  of  the  sample,  and  on  the  left  is  a 
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blow-up  of  a  'microcluster'.  The  major  topics  identified 
are  point  defects  [2],  correlations  [3],  ‘worms'  [5],  misfit 
dioslocations  [6,  7],  substrates,  extended  defects  [1], 
inclusions  [7],  bulk  concentration  fluctuations  [8]  and 
surface  segregation  [9],  Other  topics  such  as  metalliza¬ 
tion  [10],  particularly  to  p-type  material,  could  have  also 
been  included  in  this  list,  but  we  limit  our  discussion  to 
those  mentioned  above.  We  begin  by  summarizing  below 
the  most  important  aspects  of  each  of  these  topics.  A 
selected  set  wiU  be  greatly  expanded  upon  in  later 
sections  of  the  paper. 

1.1.  Point  defects 

Most  Hg,  _,Cd,Te  alloys  are  Te  rich  as-grown  [1,  2]. 
The  excess  Te  is  accommodated  by  Hg  vacancies  (desig¬ 
nated  V^,)  or  as  Te  solid  inclusions  (Te,)  [7, 1 1].  The  Hg 
vacancies  are  shallow  acceptors  (possibly  negative- 1/- 
centre  double  acceptors)  [12],  With  the  exception  of 
some  MBE  or  mocvd  material,  as-grown  material  is  p- 
type,  both  at  room  temperature  and  at  77  K  [13,  14]. 
This  material  can  usually  be  annealed  at  low  temperature 
( ~  250  °C)  in  an  Hg  overpressure  to  fill  the  Vh,  sites  and 
to  dissolve  the  Te  inclusions  [7].  When  this  is  done  the 
material  converts  to  n-type  with  carrier  concentrations 
that  are  remarkably  low,  from  a  few  times  10**cm"*toa 
few  times  10**  cm"*  in  high-purity  material  [14],  The  n 
dopant(s)  (called  the  residual  donors)  are  unknown. 
There  is  evidence  [14]  that  the  principal  residual  donor  is 
an  impurity  rather  than  a  native  defect,  but  because  it  is 
so  pervasive  the  evidence  in  not  completely  convincing. 
The  residual  donor  concentration  is  unknown,  uncon- 
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ngura  1.  Schematic  features  of  HgCdTe  alloys. 


trolled,  and  sets  fpa  limits  for  most  operating  modes. 
Identification  and  control  of  the  residual  donor  would 
undoubtedly  be  beneficial.  It  has  been  demonstrated  by 
diffusion  measurements  that  samples  annealed  in  higher 
Hg  pressures  contain  Hg  interstitials  (Hg,)  [15,  16],  but 
while  they  are  likely  to  be  donors,  the  electrical  state  of 
an  Hg|  has  never  been  directly  identified.  It  has  been 
speculated  that  Te  interstitials  (Te,)  and  antisites  (Ten,) 
[17]  are  active  recombination  centres,  but  direct  evi¬ 
dence  on  their  deep  states  is  not  available.  The  Hg 
antisite  (HgT,)  has  not  been  previously  suggested  as  a 
high-concentration  defect,  but  we  will  present  arguments 
in  a  later  section  that  it  may  be  present  in  concentrations 
sufficient  to  affect  devices  [2]. 

So  far  we  have  mentioned  only  native  defects,  but 
impurity  doping  is  now  being  brought  under  control  in 
HgCdTe  alloys.  It  has  proved  to  be  difficult  to  gain 
control  of  both  donor  and  acceptors,  but  methods  are 
now  in  hand.  Well  behaved  impurities  are  In  [18]  on 
cation  sites  serving  as  donors,  and  As  on  Te  sites  serving 
as  acceptors.  They  are  inserted  into  the  material  during 
epitaxial  growth.  Ion  implantation  and  diffusion  doping 
have  not  been  successful  enough  to  be  used  in  any 
manufacturing  process.  Partial  explanations  for  these 
observations  have  been  published,  but  refinements  are 
needed  to  make  them  quantitative.  Many  devices  are  still 
made  utilizing  V,|,  acceptors  and  residual  donors  [19]. 

IJ.  Corrdatioiis 

It  is  now  well  recognized  that  semiconductor  alloys  of  the 
form  are  rarely  truly  random.  The  A  and  B 

atoms  do  not  occupy  their  sublattice  in  the  zinc  blende 
structure  at  random  but  are  somewhat  correlated,  so 


there  is  a  degree  of  short-range  order,  and  in  some  cases 
even  long-range  order  [20,  21].  The  principal  driving 
terms  for  correlations  are  bond  length  and  chemical 
difference  between  the  constituents  AC  and  BC.  In 
Hgt_xCdiTe  the  bond  lengths  of  HgTe  and  CdTe  are 
nearly  equal,  and  the  differences  in  their  chemical  terms 
are  also  small,  so  that  correlations  are  predicted  to  be 
small.  However,  a  high  degree  of  correlation  has  been 
deduced  using  five  different  experimental  techniques  (Te 
nuclear  magnetic  resonance  (nmk)  [22],  Raman  spec¬ 
troscopy  [23,  24],  infrared  (ir)  reflection  spectroscopy 
[25]  and  x-ray  diffraction  [26]).  In  the  concentration 
range  near  x  -  0.25,  the  correlations  are  such  that  the 
material  is  tending  toward  an  ordered  structure  rather 
than  spinodal  decomposition.  Those  findings  may  have 
important  consequences  for  devices,  in  particular  if  the 
correlation  state  varies  spatially.  Such  correlation  fluc¬ 
tuations  may  be  a  source  of  spatial  variability  of  material 
properties  that  occurs  even  if  the  composition  is  uniform, 
with  the  variation  affecting  the  uniformity  of  the  band- 
gap,  native  defects  and  impurity  concentration,  mobili¬ 
ties,  etc.  The  magnitudes  of  such  variations  may  differ 
greatly  from  one  property  to  another.  For  example,  one 
would  expect  relatively  slight  bandgap  variations,  but 
rather  more  substantial  differences  in  vacancy  concentra¬ 
tions  and  diffusion  coefficients  between  regions  of  differ¬ 
ent  correlation  states.  This  occurs  because  band 
structures  are  properties  that  depend  on  site  occupations 
averaged  over  many  lattice  spacings,  but  vacancy  for¬ 
mation  energies  depend  sensitively  on  the  local  atom 
arrangement  around  the  vacant  site.  Both  the  experi¬ 
mental  and  theoretical  understanding  of  the  correlation 
state  of  HgCdTe  need  to  be  clarified.  A  more  detailed 
exposition  of  this  problem  is  presented  in  section  2. 
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U.  ‘Worms’ 

Material  grown  by  lpe  and  movpe  and  annealed  to  n 
type  often  exhibits  low  carrier  concentrations  ~  10*’ 
cm  ~  ’  but  with  electron  mobilities  far  below  those  of  bulk 
or  MBE-grown  materials.  If  these  materials  are  intention¬ 
ally  impurity  doped  to  be  slightly  more  n  type,  their 
mobilities  return  to  the  high  values  obtained  by  other 
growth  methods.  The  currently  accepted,  although  un¬ 
confirmed,  explanation  for  this  phenomenon  is  that  after 
the  Hg  overpressure  anneal,  there  are  ‘woodworm-like’ 
domains  in  the  n-type  material  that  remain  p  type  [S]. 
These  regions  produce  p-n  junctions,  and  mobilities 
measured  in  a  Hall  measurement  exhibit  a  mixed  effective 
mobility  that  appears  low.  Impurity  doping  converts 
these  p-type  worms  to  n  type,  so  while  there  is  still  a 
spatial  variation  of  the  doping  concentrations,  all  the 
material  is  n  type  so  the  measured  Hall  mobility  is  higher. 
From  the  viewpoint  of  device  limits,  such  impurity 
doping  has  two  potential  deleterious  effects.  First  the 
minimum  carrier  concentration  that  can  be  used  is 
increased,  and  second  there  is  a  spatial  variation  of  the 
carrier  concentration. 

The  worms  are  an  appealing  explanation  for  the 
observed  transport  properties,  but  if  the  goal  is  to 
improve  devices  made  from  lpe  material  then  we  must 
understand  why  the  worms  form  and  how  to  eliminate 
them  during  growth,  or  in  subsequent  processing.  Do  the 
worms  form  because  they  correspond  to  regions  of  low 
residual  donors,  or  to  regions  where  the  Vh,  formation 
energies  are  small  so  that  they  do  not  anneal  completely? 
Are  the  worms  regions  where  the  correlation  state  differs 
from  the  remaining  material?  Why  are  worms  more 
prevalent  in  LPE  or  movpe  material  than  in  higher- 
growth-temperature  bulk  material  or  lower-growth-tem¬ 
perature  MBE  material?  These  and  other  questions  are 
still  unanswered. 

M.  Misfit  dislocalHMis 

Substantial  misfit  dislocation  densities  are  always  found 
at  the  interface  between  Hgt.^Cd^Te  and  substrate 
materials,  e.g.  Cd,  _^n,Tc,  even  when  the  two  materials 
are  perfectly  lattice  matched  [6,  7].  This  situation  also 
occurs  to  a  lesser  but  still  significant  extent  at  heterojunc¬ 
tions.  The  reasons  for  these  anomalous  dislocation 
densities  remain  to  be  determined.  Because  dislocations 
have  been  demonstrated  to  have  a  deleterious  impact  on 
minority  carrier  lifetimes,  their  effect  is  of  particular 
concern  at  heterojunctions  where  they  degrade  RqA 
products  [27-29]. 

1,5.  Substrates 

This  topic  has  been  extensively  reviewed  in  other  confer¬ 
ence  papers  [1].  The  ideal  substrate  material  would  be  a 
large-area,  iR-transparent  insulator  that  is  matched  to 
the  active  layer  by  lattice  constant,  thermal  expansion 
coefficient  and  chemical  potential,  and  on  which  signal 
processing  circuitry  could  be  integrated.  This  idealization 


does  not  exist  for  Hg,  -^Cd^Te,  so  compromises  must  be 
made.  The  materials  CdZnTe,  and  GaAs  or  Si  with  buffer 
layers,  are  the  current  leading  contenders. 

l,fi.  Extended  defects 

A  variety  of  extended  defects,  grain  boundaries,  anti¬ 
phase  boundaries,  twin  planes,  threading  dislocations, 
striations,  etc  have  all  been  directly  observed  in  transmis¬ 
sion  electron  microscope  (tem)  studies  or  their  existence 
deduced  from  indirect  evidence.  Their  impact  on  device 
characteristics  is  not  well  established,  but  is  certainly 
never  helpful.  Because  large-area  fpas  are  being  made, 
extended  defects  obviously  are  relatively  dilute  or  ineffec¬ 
tive  in  current  device-grade  material.  However,  improve¬ 
ments  are  still  possible. 

1.7.  Inclnsioiis 

We  have  already  mentioned  Te  inclusions,  and  they  will 
be  analysed  more  extensively  in  later  sections  of  this 
paper.  Diey  always  have  an  array  of  dislocations  around 
them  [7],  are  thought  to  serve  as  getters  for  some 
impurities,  and,  as  we  suggest  later,  are  likely  to  have  a 
non-equilibrium  atmosphere  of  and  Ten,  around 
them  [2].  When  they  are  dissolved  in  the  Hg  anneal, 
some  of  their  associated  dislocations  climb  to  surfaces 
[7]  and  are  eliminated,  although  some  remain;  the 
gettered  impurities  may  remain  as  inclusions,  and  the 
larger  Te,  inclusions  may  not  be  completely  dissolved. 
Means  to  avoid  Te,  inclusion  formation  in  as-grown 
material  are  most  desirable. 

lA  Balk  coocentration  floctoatioas 

Even  if  an  alloy  is  random,  any  segment  in  space  has  a 
Bernoulli  probability  concentration  distribution  [30]. 
Thus  one  expects  small-scale  concentration  fluctuations 
if  there  is  nothing  to  suppress  them.  Because  of  the  near 
lattice  match  between  HgTe  and  CdTe,  a  region  of  space 
with  a  concentration  that  differs  significantly  from  the 
average  produces  no  long-range  strain  field  and  is  there¬ 
fore  not  strain  suppressed.  The  same  effect  has  been 
found  in  the  lattice-matched  alloy  Ali-^Ga^As 
[31].  However,  in  a  lattice-mismatched  material  like 
Hg, -iZn^Te,  such  short-range  concentration  fluctua¬ 
tions  are  suppressed.  One  consequence  of  this  is  that  in 
HgCdTe  the  exciton  line  is  very  broad,  while  in  HgZnTe 
it  is  narrow  [8].  This  effect  should  be  particularly 
troublesome  in  vlwir  material  where  the  bandgap  is  very 
small  and  fluctuations  could  produce  semi-metal  short¬ 
ing  domains. 

1.9.  Sorface  segregatioa 

In  general,  an  alloy  in  equilibrium  wiU  not  have  a 
uniform  concentration  near  an  interface  [9].  This  is  most 
easily  understood  at  a  vacuum  surface  where  there  are 
dancing  bonds.  The  material  can  minimize  its  free 
energy  by  having  the  alloy  species  that  makes  the  weak¬ 
est  bonds  concentrate  at  the  surface.  Also  in  lattice- 
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mismatched  alloys  the  low-concentration  species  will  be 
driven  to  the  surface  by  ‘strain  release'  because  they 
experience  less  strain  energy  at  the  surface  than  in  the 
bulk.  In  the  near-lattice-matched  Hg,_;,Cd.Te  alloy, 
strain  release  plays  little  part,  and  therefore  surfaces  are 
Hg  rich,  because  Hg  makes  weaker  bonds  to  Te  than 
does  Cd.  Detailed  predictions  are  that  for  x  <  0.4  the 
surface  layer  will  have  concentrations  in  the  semi-metal 
range  returning  to  the  bulk  concentration  within  a  few 
atomic  layers.  Thus,  in  equilibrium,  mwir  and  lwir 
HgCdTe  alloys  will  have  Hg-rich  semi-metal  surfaces.  A 
CdTe-passivated  HgCdTe  alloy  will  have  a  uniform 
concentration  up  to  the  interface  because,  both  in  the 
bulk  and  at  the  interface,  cations  will  be  surrounded  by 
four  Te  atoms,  so  there  is  no  driving  force  for  segregation. 
There  is  also  little  driving  force  for  segregation  at  a  ZnTe 
interface  with  HgCdTe.  These  conclusions  are  true  only  if 
interdiffusion  is  prohibited  across  the  interface. 

Because  there  is  less  tendency  for  interdiffusion  be¬ 
tween  ZnTe  and  Hg,.xCd;,Te  than  between  CdTe  and 
the  alloy,  we  have  suggested  that  a  few  atomic  layers  of 
ZnTe  followed  by  a  thicker  CdTe  layer  should  be  a 
superior  passivant. 

A  CdS/HgCdTe  interface  should  be  Cd  rich  because 
the  CdS  bond  strength  exceeds  that  of  HgS  by  more  than 
that  between  CdTe  and  HgTe.  A  similar  argument  ap¬ 
plies  to  oxides,  except  that  the  lattice-constant  mismatch 
becomes  so  large  that  dislocations  and  grain  boundaries 
begin  to  exert  a  much  larger  influence  on  the  net  result. 

The  rapid  return  of  the  interface  concentration  to  its 
bulk  value  (a  few  atomic  layers)  is  predicted  to  occur  only 
above  an  alloy's  order-disorder  transition  temperatures 
[3,  32].  Below  this  temperature,  long-range  large-excur¬ 
sion  concentration  oscillations  should  occur.  Because  of 
its  expected  low  order-disorder  temperature,  one  would 
tend  to  discount  such  phenomena  from  consideration  in 
HgCdTe.  However,  the  observations  of  large  correlation 
effects  makes  us  pause.  If  these  effects  ate  confirmed,  then 
there  may  be  comparatively  long-range  concentration 
fluctuations  adjacent  to  an  epitaxial  layer-substrate  in¬ 
terface,  which  may  help  to  explain  the  anomalous  misfit 
dislocation  densities  that  are  observed.  This  suggestion 
(first  made  by  Spicer  et  al  [10],  but  without  a  mechanism 
identified)  is  highly  speculative,  and  while  concentration 
oscillations  driven  by  surface  segregation  and  interface 
boundary  conditions  are  a  natural  consequence  of  a 
higher  than  expected  order-disorder  transition  tempera¬ 
ture,  there  is  no  direct  evidence  to  support  its  occurrence 
at  a  HgCdTe/CdTe  interface.  Relevant  experiments 
would  be  helpful. 

The  remainder  of  this  paper  will  be  devoted  to  an 
exposition  of  our  current  understanding  of  correlations, 
native  defects  and  their  diffusion. 


2.  Rnvinw  ol  corraIntioiM 

We  [21]  and  others  [33-35]  have  demonstrated  that 
there  are  always  correlations  in  alloy  semiconductors. 
Here  we  summarize  our  previous  work,  focusing  atten¬ 


tion  on  those  aspects  that  may  help  us  to  identify  what 
may  be  missing  from  the  interactions  driving  correlations 
in  HgCdTe.  We  consider  an  average  population  Xj  of 
five-atom  anion-centred  clusters  including  16  bonds 
(most  others  treat  four  bonds  only).  The  population  of 
five-atom  clusters  of  the  form  A4_,^B,^C  can  be  shown  to 
be  grand  canonical  ensembles  of  the  form 

Xj  =  Qj  explifi^nj  -  Ej)/kr\/q{{elj]},  n,)  (1) 

where  the  partition  function  is 

j 

/^b)  =  Z  9j  expUngtij  -  Ej)/kT}  (2) 

and  is  the  degeneracy  of  cluster  j  =  0, 1, . . . ,  J  (4  ^  J  ^ 
2*),  itj  is  the  number  of  B  atoms  on  cluster  j,  Ej  is  the 
excess  energy  of  cluster  j  relative  to  the  virtual  crystal 
approximation  (vca)  average  Evca  =  (1  — 

(called  Aj  by  Sher  et  al  [21]),  and  e,  are  cluster 
energies  of  the  pure  AC  and  BC  compounds,  k  is  the 
Boltzmann  constant  and  Tis  the  absolute  temperature. 

If  the  clusters  do  not  have  their  normal  counting 
degeneracies 


split,  then  we  have  Qj  =  gj  the  binomial  coefficient,  and 
J  =  4.  The  chemical  potential  is  set  by  insisting  that  the 
composition  x  is  correct 

4x  =  "j  Xj.  (3) 

J 

If  one  makes  a  transformation  to  the  reduced  excess- 
energies  representation 

(called  A}  by  Sher  et  al  [21])  then  Xj  becomes 

Xj  =  Qj  cxpUfi'/ta  -  A^)k71/q({A/},  n'^)  (5) 

where 

A*b  “  Mb  ~  ~  Cq)/^-  (6) 

Note  that  we  have  Ao  =  Aj  =  0,  so  at  least  two 
members  of  the  set  {A^}  vanish.  More  importantly,  only 
the  reduced  excess  energies  actually  drive  the  cluster 
populations.  We  shall  see  shortly  that  the  energies  of  the 
{Aj}  set  are  often  five  to  ten  times  smaller  than  those  of 
the  {c^}  set.  Most  of  the  strain  contribution  to  the  {e^}  set 
subtracts  out  and  contributes  little  to  the  Xj  values. 

There  is  an  additional  effect.  It  can  best  be  appre¬ 
ciated  when  stated  as  a  theorem:  If 


then  J  =  4  and 

x,  =  x9sff?(I -x)'«->'x->  (7) 

where  x^  are  the  populations  of  a  random  alloy.  Thus,  no 
matter  how  large  the  constant  5  may  be,  as  long  as  the 
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energies  Aj  vary  linearly  with  the  number  of  B  atoms  on 
the  cluster  rij  and  the  counting  degeneracy  is  not  split,  the 
alloy  is  random.  Strain  energies  from  bond-length  differ¬ 
ences  between  the  constituents,  chemical  energies  and 
electron-electron  interactions  all  have  some  nonlinear 
variations  with  rij.  The  degeneracy  g°  is  split  by  coherent 
strains,  temperature  gradients,  or  anything  that  estab¬ 
lishes  preferred  directions  for  the  locations  of  A  and  B 
atoms  on  a  cluster.  These  splittings  always  establish  a 
preference  for  one  type  of  cluster  in  a  particular  orienta¬ 
tion  [e.g.  for  a  strain  in  the  (1 1 1)  direction,  AjB  with  the 
B  oriented  along  the  (111)  axis],  and  therefore  drive 
deviations  of  the  {x^}  set  from  {x^}  toward  compound 
formation,  and,  if  the  splittings  are  large  enough,  toward 
long-range  order.  The  effect  of  the  energies  [Aj]  can  drive 
the  {x^}  set  toward  either  compound  formation  or  spino- 
dal  deromposition,  depending  on  details. 

The  excess  free  energy  of  an  Ai-^B.C  alloy  can  be 
written  as 

AT  =  A£  -  TAS  (8) 

where  the  excess  energy  is 

A£  =  Ml£jXj  (9) 

j 

with  M  being  the  number  of  clusters.  The  excess  entropy 
is 

with  Mj  ^  Xjhd  (see  Sher  et  al  [21]  for  the  justification). 

The  total  number  of  clusters  M  is  related  to  the  number 
of  Bravais  lattice  sites  N  in  the  crystal  by  M  >  N/4  for  a 
16-bond  cluster  (and  M  »  A(  for  a  four-bond  cluster). 
Equations  (9)  and  (10)  can  be  rewritten  for  the  16-bond 
cluster  as 

AE  ^  -  x)€o  +  xej +  (11) 

and 

AS  =  —  —  x)ln(I  —  x)  +  X  In  X 

»  A/fy  -  ^  ^  ^  A‘i)-  02) 

Note  in  the  first  equality  in  equation  (12)  that  M/N  is 
1  for  a  four-bond  and  1/4  for  a  l^bond  cluster.  In  the  16- 
bond  cluster  case,  there  is,  as  a  consequence,  an  exact 
cancellation  of  the  (1  -  x)  In  (1  -  x)  -t-  x  In  x  term  by  a 
portion  of  the  third  ^erm  in  the  bracket.  This  occurs 
physically  because,  in  the  16-bond  cluster,  all  the  bonds 
associatMl  with  a  given  substituted  atom  are  in  the 
cluster.  In  the  four-bond  case,  however,  bonds  from  each 
substituted  atom  contribute  to  four  different  clusters. 
Several  features  of  equations  (11)  and  (12)  are  note¬ 
worthy.  First,  Co  and  Cj  are  functions  of  x  but  are 
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temperature  independent.  For  bond-length-misnuitched 
alloys,  they  vary  with  x  roughly  as  Cq  %  and  tj  % 

(1  —  x)*,  so  that  we  have 

A£  %  N(a  x(I  -  X)  -t-  £  AjXj^  (13) 

where  the  mixing  enthalpy  parameter,  Q  is  approxi¬ 
mately 

n*[£?  +  (€S-«JW/4  (14) 

and  nearly  x  independent  if  Eq  %  e°,  as  is  often  the  case. 
The  second  term  in  equation  (13)  is  usually  stiutll  com¬ 
pared  with  the  first  and  contains  all  the  temperature 
dependence.  However,  when  equations  (S),  (12)  and  (13) 
ate  inserted  into  equation  (8)  to  obtain  AF,  a  term  in  TAS 
exactly  cancels  the  second  term  in  equation  (13).  Thus,  it 
is  impossible  to  determine  the  temperature  variation  of 
A£  from  a  measurement  of  AF.  The  expression  for  AF 
becomes 

AF  =  Af[Q x(l  -  x)  -I-  x/i'u  -  ikTln  q({A^}, |4)].  (15) 

The  chemical  potential  is  determined  from  equa¬ 
tion  (3),  and  both  and  q  are  generally  temperature 
dependent. 

We  have  calculated  {e^},  {A^},  /4>  }•  A£(T) 

and  AF(T)  for  Hg,  _,Cd,Te  (figure  2)  and  Hg,  .^Zn^Te 
(figure  3)  alloys.  The  energies  tj  for  these  figures  were 
calculated  as  discussed  in  detail  in  [21].  The  16-bond 
cluster  was  attached  to  a  rigid  medium  at  the  virtual 
crystal  positions  in  the  third  shell  from  the  cluster  centre. 
The  atmn  positions  inside  the  third  shell  were  adjusted  to 
minimize  the  strain  and  chemical  energies.  The  energy  Cj 
is  the  total  energy  of  the  16  bonds  with  the  atoms  in  their 
minimum-energy  positions.  The  variations  of  atom  boixl 
lengths  in  different  alloys  are  well  predicted  by  this 
method.  The  AF  curves  for  Hg,_,Cd,Te  show  no  tenden¬ 
cy  for  an  order-disorder  transition  in  the  temperature 
range  studied,  but  Xj  -  xj  values  do  deviate  from  zero. 
Hg}  _j^,Te  does  have  an  order-disorder  spinodal  tran¬ 
sition  with  critical  [36]  temperature  T,  s  380  K.  Al¬ 
though  Hg,  .;,Cd,Te  shows  significant  deviations  of  the 
Xj  from  random  alloy  values  for  a  sample  equilibrated  at 
300  K,  they  are  still  not  as  large  as  those  measured  in  the 
NMR  experiments,  meaning  that  a  nwjor  interaction  nuiy 
be  missing  However,  at  300  K  diffusion  may  be  fast 
enough  in  HgCdTe  for  the  material  to  equilibrate  even  at 
this  low  temperature  (see  the  discussion  in  the  next 
section).  The  slower  diffusion  observed  in  HgZnTe  may 
be  a  distinct  advantage,  one  that  counters  its  above¬ 
room-temperature  critical  temperature. 

We  have  shown  that  only  the  small  {Ay}  drive  devia¬ 
tions  in  the  populations  Xj  away  from  randomness. 
Moreover,  if  Ay »  n/  is  linear  in  Ry  the  number  of  B 
atoms,  then  despite  the  size  of  S,  the  populations  are  still 
random.  Thus,  ^ects  that  would  otherwise  be  considered 
small  may  compete  with  the  larger  energies  retained  if 
they  have  the  proper  nonlinearity.  We  are  examining 
several  possibibties  in  Hg,  ..Cd^Te.  These  indude  effects 
caused  by  screening  in  the  composition  range  where  the 
alloy  is  a  semi-metal,  direct  second-neighbour  chemical 
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interactions  treated  in  the  context  of  Harrison’s  model 
[37],  electron-electron  Coulomb  interactions  driven  by 
polarity  differences  between  the  constituents,  and  com¬ 
position  variations  of  the  elastic  constants. 


3.  NaMvn  dafacia  and  dNhiaiofi 


3wl.  Datafittiag 

There  have  been  several  measurements  [IS,  16]  of  the  Hg 
vacancy  formation  energy  and  Hg,  Cd  and  Te  tracer 
diffusion  coefficients.  These  measurements  are  always 
fitted  to  a  functional  form 

C  -  G„(i»)cxp(-  E/kT)  (16) 

where  £'  is  interpreted  as  the  activation  energy  and 
G«(P)  is  the  infinite  temperature  limit  of  G.  For  vacancy 
formation,  G  represents  the  vacancy  concentration,  and 
for  diffusion  the  infinite  temperature  diffusion  coefficient 
It  is  assumed  in  the  fitting  process  that  G^{P)  can  be 
viewed  as  a  function  only  of  the  partial  pressure  P  above 
the  sample  of  the  species  under  investigation.  However, 
the  pre-exponential  coefficient  always  has  a  power-law 
temperature  contribution,  i.e.  a  7**  multiplicative  com¬ 
ponent.  Because  the  measurements  extend  over  a  limited 
temperature  range,  T,  ^  T2,  usually  have  some  scat¬ 

ter,  and  the  T"  term  varies  more  slowly  than  the  expon¬ 
ential,  reasotuMe  fits  to  the  data  are  obtained.  However, 
when  £'  values  are  quoted  to  more  than  one  significant 
figure,  and  G„  to  the  proper  order  of  magnitude,  it  is 
important  to  irKlude  the  T”  factor  in  the  fitting  proce- 
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dure.  Because  we  do  not  have  access  to  the  original  data 
we  have  corrected  published  numbers  by  assuming  the 
straight-line  fits  of  In  G  versus  1/T  were  made  through 
the  points  T3  and  T4  given  by 

r4-‘  =  i(Tr>-t-2rr')  (17) 

Then  the  correct  expression 

G  =  C(P)T"  exp(  -  E/kT)  (18) 

parameters  are  related  to  the  fitted  parameters  by  the 
relations 


and 


E  =  E'  -mkt  (19) 


lln^  =  :^ 

m  C7T  74 


(20) 


where 


t  =  (7,-  ‘  74- ‘ )- '  ln(74/7,)  s  74.  (21) 

The  approximate  expression  to  r  in  equation  (21)  is 
accurate  if  1«  —  73  ^  7^.  Note  that  if  m  is  positive  £  <  £' 
and  if  m  is  negative  £  >  £'. 

To  illustrate  the  effect  of  these  corrections,  we  have 
examined  the  case  of  Vh,  formation  and  diffusion  and  the 
interstitial  Hg|  diffusion  in  Hgo.TgCdo.22Te.  The  mass 
action  equation  for  formation  of  Vh, 

HgTe*>VH,Te-^Hg,  (22) 

where  Hg  represents  a  Hg  atom  on  a  Hg  site,  and  Hg, 
represents  Hg  in  the  vapour  phase,  has  been  analysed 
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using  standard  results  in  sutistical  mechanics.  The  diffu¬ 
sion  coefficient  D  is  given  by 


where 

(24) 

and  where  a  is  the  hop  distance,  is  the  mean  free  time 
between  hops,  n/N  is  the  fraction  of  lattice  sites  N 
occupied  by  vacancies,  w,  is  the  presentation  frequency 
of  an  atom  at  the  barrier  between  it  and  an  adjacent 
vacant  site  and/,,  is  the  probability  that  an  adjacent  atom 
reaches  the  intermediate  position  at  the  peak  of  the 
barrier  between  it  and  an  adjacent  vacancy.  (The  prime  is 
inserted  to  remind  us  that  this  is  not  a  normal  interstitial 
because  it  lies  between  two  vacant  sites.) 


The  quantity  f,-  is  calculated  roughly  from  the  mass 
action  equation  for  the  effective  reaction 

HgTe-l-V„„<*V„,Tc-».Hg,.  (25) 

again  remembering  that  the  HgTe  is  adjacent  to  a  Vh,  in 
the  initial  sute. 

Then,  the  assumption  that  the  Hg,.  in  the  interme¬ 
diate  position  does  not  remain  long  enough  for  its 
vibrational  states  to  thermally  equilibrate  leads  to  the 
power-law  m  values  quoted  in  table  1. 

There  are  several  noteworthy  features  to  the  informa¬ 
tion  in  table  1.  First,  the  corrections  to  the  activation 
energies  are  10  to  20%,  which  is  important  if  the  second 
and  third  significant  figures  quoted  are  to  be  taken 
seriously.  Second,  note  that  for  both  vacancy  and  inter¬ 
stitial  diffusion  in  the  Chen  [16]  and  in  the  Tang  and 
Stevenson  [15]  work  a  larger  activation  energy  corre¬ 
lates  to  a  larger  pre-exponential  factor.  This  occurs 


TaMe  1.  Corrections  to  experimental  activation  energies  and  pre-exponential  coefficients. 


Experiment 

r(eV) 

(cm*  s”') 

(K) 

^(K) 

t 

E(eV) 

C 

m 

Vacancy  formation 

Vydyanath  {13,  14] 

2.24 

400 

650 

495 

2.01 

11/2 

Vacancy  diffusion 

Chen  [16] 

2.40 

4.7  X  10* 

400 

480 

441 

2.08 

3.2  X  10-*’ 

17/2 

Tang  and  Stevenson  (15) 

2.10 

4.87  X  10*  P^' 

350 

soo 

412 

1.80 

5.8  X  10-**P,^’t 

17/2 

Interstitial  diffusion 

Chen  (16] 

0.S4 

1.1  X  10-'P,„ 

400 

490 

441 

0.64 

5.5  X  10-’*P„,J 

-5/2 

Tang  and  Stevenson  (IS] 

0.61 

S.S  X  10- 

3S0 

500 

412 

0.70 

2.4  X  10-’®P„,J 

-5/2 

tlcm^s-’l-r”* 

:cm-r^s-’. 
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because  of  the  7"  dependence  and  the  fact  that  the  two 
experiments  were  conducted  over  different  temperature 
intervals  7,  and  T2. 

The  corrected  numbers  for  the  two  groups  are  closer 
together  than  the  uncorrected  ones  in  both  cases,  though 
still  not  in  perfect  agreement.  If  we  average  the  corrected 
Hg  diffusion  activation  energies  of  the  two  experimental 
groups  we  get 

£d.v„.  =  1.94  ±  0.2  eV 

£d,h,.  =  0.67  ±  0.04  eV. 

3JL  Native  defects 

Much  of  the  following  discussion  has  appeared  pre¬ 
viously  in  [2].  As  mentioned  earlier  the  doubly  ionized 
cation  vacancy  is  believed  to  be  the  dominant  native 
defect  in  HgCdTe  that  dictates  the  electrical  behaviour  of 
the  undoped  material  [13. 14].  We  have  recently  reported 
the  results  of  calculations  of  native  and  impurity  defect 
total  energies  in  HgTc,  CdTe  and  ZnTe  [18].  The  ener¬ 
gies  for  the  formation  of  various  native  point  defects  in 
HgTc  are  summarized  in  table  2;  the  HgTe  solid  and  the 
Hg  in  the  vapour  are  used  as  reference  energiest.  These 
calculations  used  the  linearized  muffin-tin  orbital  (lmto) 
method  within  the  local  density  approximation  (lda)  to 
the  exchange  correlation  energy.  Large  supercells  con¬ 
taining  one  defect  per  cell  were  repeated  periodically,  and 
from  the  difference  in  total  energies  per  cell,  with 
and  without  the  defect,  the  defect  formation  energy  was 
calculated.  To  expedite  the  calculations,  the  energies 
were  calculated  within  the  atomic  spheres  approximation 
(ASA)  with  a  small  (spd)  basis  set.  In  the  asa,  an  approxi¬ 
mation  to  the  exact  density  functional  is  evaluated;  as  a 
result,  an  error  is  introduced  which  is  larger  than  in  other 
LDA  methods  [38],  and  relaxation  energies  cannot  be 
accurately  calculated.  Thus  only  those  differences  in 
energies  that  are  >  0.5  eV  here  should  be  viewed  as 
significant  for  these  calculations. 

An  appropriate  set  of  mass  action  constants  for  the 
neutral  defect  reactions  is  also  given  in  table  2.  The 

^  A  different  reference  is  used  here  than  was  used  in  table  III  in  [18]. 
An  error  appears  in  that  table  owing  to  the  incorrect  use  of  an 
energy  of  2£„  per  unit  ceil  rather  than  4£„.  The  defect  ener^es  in 
table  I  in  [18].  from  which  the  energies  in  table  III  were  derived,  are 
correct. 


notation  in  that  table  is  as  follows.  Ag  corresponds  to  an 
A  species  occupying  a  B  site,  where  1  corresponds  to  an 
interstitial  and  V  to  a  vacancy.  No  subscript  on  a  species 
indicates  that  it  is  occupying  the  correct  lattice  site,  e.g. 
Hg  =  HgH,.  Square  brackets  [  ]  refer  to  concentrations. 
A  subscript  ‘g’  indicates  the  species  in  the  gaseous,  or 
vapour,  phase,  and  Phi  Hg  vapour  pressure.  Most  of 
the  reactions  in  table  2  involve  the  creation  or  destruc¬ 
tion  of  one  or  more  unit  cells.  Because  the  resulting 
change  in  volume  is  accommodated  at  the  surface,  the 
change  in  the  number  of  unit  cells  will  enter  into  the 
determination  of  the  defect  equilibrium  through  the 
surface  entropy.  Additionally,  surface  preparation  and 
orientation  will  affect  the  surface  free  energy.  We  have 
assumed  for  the  present  that  such  surface  effects  are 
negligible,  i.e.  that  the  volume  expansions  and  contrac¬ 
tions  can  occur  with  negligible  changes  in  the  surface 
properties. 

To  complete  the  defect  equilibrium  determination 
correctly  we  must  include  the  equilibration  of  the  elec¬ 
tronic  charges  of  the  system.  To  do  so  we  must  have 
knowledge  of  the  dominant  charge  states  of  the  defects 
and  their  activation  energies  with  respect  to  the  neutral 
defect.  Such  calculations  are  complicated  by  the  fact  that 
most  ab  initio  calculations  of  the  electronic  band  structure 
of  semiconductors  predict  an  incorrect  bandgap,  Ec,  a 
shortcoming  of  the  local  density  approximation  (lda). 
Therefore,  we  shall  focus  on  the  neutral  native  defects 
here,  and  the  established  or  expected  charge  states  of 
these  defects. 

In  wide-bandgap  materials  the  equilibration  of  de¬ 
fects  can  be  substantially  affected  by  the  Fermi  level;  for 
example  the  formation  energy  of  a  donor  will  decrease 
when  the  Fermi  energy  is  near  the  valence  band  edge, 
since  the  donor  electron  can  drop  into  a  vacant  state  near 
the  valence  band,  thereby  lowering  the  energy  by  ~  Eq. 
Because  we  are  discussing  HgCdTe  with  a  narrow  band- 
gap,  we  expect  the  Fermi  effects  to  be  small,  but  not 
insignificant  at  high  temperatures.  Because  of  the  small 
conduction  band  effective  mass,  in  n-type  material  the 
filling  of  the  conduction  band  states  by  electrons  can  shift 
the  Fermi  energy  significantly.  Combined  with  the  in¬ 
crease  in  the  bandgap  for  the  high  temperatures  at  which 
most  defect  studies  are  done,  the  effective  bandgap  can 
be  substantially  larger  than  the  usual  77  K  bandgap 
associated  with  a  given  concentration  of  HgCdTe. 


Table  2.  Defect  reactions  and  formation  energies. 


Defect  reaction 

Defection  concentration 

Energy  (eV) 

+  HgTe  — V„,Te  +  Hg, 

(Vh,1  =  Ph,’a;„  expl-  £v„/ftT) 

2.01t 

Et*,  +  2HgTe  — Te„,Te  +  2Hgg 

4.53 

£r„  +  HgTe  — Te,Te  +  Hg, 

[Te.J  =  axpi-ErJKT) 

4.96 

Ev,.  +  Hgg-HgVT. 

[Vt.1  =  Ph,a;..  exp(-  E„,/kT) 

3.12t 

+  2Hgg  — HgHgT, 

[H»t.J  =  f%K,r.  exp(-  £h„./*7) 

-0.42 

Eh.,  +  Hg,  — Hg, 

IHg.)  -  P„,K«„  Bxp(~  E^^kT) 

0.84,  0.98 

t  Corrected  experimental  number  from  Vydyanath  [13,  14} 
t  Calculated  using  a  tight-binding  Hamiltonian  [17]. 
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First  we  consider  the  defects  which  accommodate 
excess  tellurium— the  first  three  defects  in  table  2.  The 
mass  action  constants  are  given  by  a  product  of  the  form 

K,  = /C?exp(-A£,/lfcn  (26) 

For  the  first  three  equations  the  K.°  are  given 
by 

=  Co(fcr)*'^(2jtm„,)^'^h-^exp(ASv„./<c)  (27) 
=  Co(*:7^’(27tm„,)^h-‘-  exp(AST.„,/k)  (28) 
and 


^^T..  =  exvi^S^Jk)  (29) 

Here  T  is  the  temperature  in  kelvin,  k  is  Boltzmann's 
constant,  mn,  is  the  mass  of  the  mercury  vapour  atoms,  h 
is  Planck's  constant,  ASj  is  the  change  in  vibrational 
entropy  upon  formation  of  the  defect  and  Cq  converts 
from  site  fraction  to  volume  concentrations.  Estimates, 
valid  at  high  temperature,  of  the  temperature  variation  of 
the  AS,  terms  were  included  in  the  pre-exponential  de¬ 
pendence  T"  of  the  reaction  constants  in  table  1.  Because 
two  unit  cells  of  HgTe  are  destroyed  when  a  tellurium 
antisite  is  created,  compared  with  one  unit  cell  when  a 
mercury  vacancy  is  created,  we  do  not  expect  that 
exp[(AST.„,  -  ASv^l/lcT]  1.  While  we  have  not  com¬ 
pleted  the  evaluation  of  these  entropy  terms,  our  preli¬ 
minary  estimates  indicate  that  this  ratio  is  ~  10*.  For  the 
tellurium  interstitial  and  the  mercury  vacancy,  we  expect 
that  exp[(AST.,  -  ASv„,)/kTl  %  1  will  be  correct  within 
a  factor  of  10.  Evaluating  the  numerical  constants  we  find 


[Tch,] 

[Vh.3 


lO'io 


[Te,] 

[V„,] 


=:  lO'" 


(30) 

(31) 


for  T  =  500  °C  and  P^i  =  •  The  conclusion  from 
equations  (30)  and  (31)  that  the  mercury  vacancy  is  the 
dominant  native  defect  is  consistent  with  experimental 
observation.  This  conclusion  is  unchanged  if  we  include 
the  possibility  that  the  species  may  be  ionized  at  the 
equilibration  temperature  where  the  material  is  expected 
to  be  intrinsic.  Although  the  tellurium  antisite  density 
decreases  more  rapidly  with  decreasing  Hg  pressure  than 
does  the  mercury  vacancy  density,  the  point  at  which  the 
concentrations  are  comparable  is  at  less  than 
10~‘°atm,  and  certainly  the  HgTe  phase  boundary  is 
reached  before  such  low  Hg  pressures  can  be  achieved. 
This  is  also  consistent  with  the  fact  that  no  p-to-n 
conversion  is  observed  in  isothermal  anneals  for  low 
mercury  pressures  [13,  14,  38],  as  would  be  expected  if 
tellurium  antisites  became  the  majority  native  defect. 
Because  the  pressure  dependences  of  the  tellurium  inter¬ 
stitial  and  the  mercury  vacancy  concentrations  are  the 
same,  the  above  conclusions  will  hold  independent  of  the 
mercury  pressure. 

We  have  checked  the  sensitivity  of  the  calculated 
concentration  ratios  to  the  magnitude  of  the  reaction 
enthalpy.  Because  the  enthalpies  enter  the  exponents, 
small  changes  in  the  enthalpies  will  result  in  large 


changes  in  the  predicted  defect  concentrations.  For  ex¬ 
ample,  let  us  assume  that  our  calculated  antisite  forma¬ 
tion  enthalpy  is  in  error  by  0.5  eV :  in  this  case  the  ratio  of 
antisite-to-vacancy  concentrations  (at  500  “C)  will  be 
reduced  to  ~  1  x  10~“.  For  an  antisite  formation  en¬ 
thalpy  in  error  by  1.0  eV,  this  ratio  is  reduced  to  ~  1  x 
10"’.  We  do  not  expect  the  asa  errors  to  exceed  0.5  cV 
[5]. 

If  the  HgCdTe  is  not  completely  annealed,  and  tellur¬ 
ium  precipitates  are  still  present,  the  defect  equilibrium 
will  not  be  that  predicted  by  the  mass  action  equations 
given  in  table  2.  For  example,  near  the  inclusions  we  can 
assume  that  the  defects  will  be  nearly  in  equilibrium  with 
the  tellurium  solid;  thus 


and 


£v„.  +  Te.«V„,Te  (32) 

EVcm,  2Te,<-»TeH,Te  (33) 


will  be  the  appropriate  reactions.  The  formation  energies 
for  a  tellurium  antisite  and  an  Hg  vacancy  from  the 
tellurium  solid  are  calculated  to  be  1.63  eV  and  1.15  eV, 
respectively.  Although  the  difference  in  the  formation 
energies  is  less  than  when  both  defects  are  referenced  to 
the  mercury  vapour  ( ~  0.5  eV  compared  with  ~  2  eV), 
the  gas  phase  entropy  factor  does  not  enter  into  the  ratio 
of  the  defect  concentrations.  Using  the  same  estimate  of 
the  entropy  ratio,  the  defect  concentration  ratio  using 
tellurium  solid  as  the  reference  state  is 


(«, 

Thus,  near  the  inclusions  we  expect  higher  relative 
concentration  of  tellurium  antisites,  as  compared  with 
the  rest  of  the  material  equilibrated  with  the  Hg  vapour. 
Additionally,  the  absolute  [Vh,]  defect  concentrations 
may  differ  substantially  in  the  two  regions  of  the  mater¬ 
ial.  At  present  we  expect  [V„,]  to  be  higher  in  an 
'atmosphere'  surrounding  a  Te  inclusion.  A  better  calcu¬ 
lation  of  the  vibrational  entropy  is  needed  before  we  can 
predict  these  absolute  defect  concentrations  and  their 
spatial  extent.  Differences  in  the  defect  concentrations 
arising  from  different  equilibration  conditions  are  a 
possible  source  of  spatial  variability  of  the  HgCdTe 
material.  If  the  material  is  not  fully  armealed  to  equilib¬ 
rium,  for  example  because  of  an  abundance  of  tellurium 
precipitates,  this  history  may  affect  subsequent  process¬ 
ing. 

In  the  above  we  have  discussed  the  defect  energies  for 
HgTe  and  applied  them  directly  to  the  small-x 
Hgi-;,Cd,Tc  system.  Because  we  ate  dealing  with  the 
native  defects  of  an  alloy,  we  expect  a  number  of  com¬ 
plexities  to  affect  the  above  analysis.  First  the  variation  of 
the  defect  formation  energies  for  vacancies  is  sensitive  to 
the  alloy  environment,  in  particular  for  the  vacancies  of 
the  non-substituted  species,  such  as  tellurium  in  HgCdTe 
[39].  Even  for  vacancies  of  the  substituted  species,  we 
have  found  that  the  formation  energies  may  vary  by 
several  tenths  of  an  electron  volt.  Because  of  this  varia¬ 
tion  in  the  formation  energy,  the  fraction  of  defective  sites 
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will  vary  by  as  much  as  a  factor  of  100  from  one  class  of 
sices  to  the  next.  Consider  various  classes  of  Hg  sites  in 
ideal  HgCdTe,  which  can  be  distinguished  to  first  order 
by  specifying  the  number  of  Hg  and  Cd  atoms  in  the 
second-neighbour  shell  (the  four  first  neighbours  are 
always  tellurium),  Hgij-yCdj,  with  a  concentration 
given  by  [;].  The  total  vacancy  concentration  is  given  by 

[V„.]  =  I  [7  ]  Ph,'  exp(  -  Ej/kT)  (35) 

j  =  I 

where  £j  is  the  vacancy  formation  energy  for  the  7th 
cluster.  The  populations  of  vacancies  in  each  class  of 
cluster,  7,  can  be  expected  to  differ  because  of  differences 
in  the  cluster  populations  and  the  formation  energies. 
Additionally,  the  defect  energy  levels  may  differ  in  the 
various  classes  of  sites,  possibly  leading  to  different 
ionization  states  for  vacancies  in  different  classes  of  sites. 
If  the  cations  in  the  alloy  are  randomly  arranged,  such 
differences  may  be  difficult  to  infer  experimentally.  If, 
however,  the  cations  are  correlated,  exhibiting  short- 
range  order,  more  complex  behaviour  may  be  present.  As 
discussed  earlier,  such  short-range  order  has  been  found 
in  HgCdTe.  In  these  cases,  the  contribution  to  the 
vacancy  densities  from  the  dominant  class  of  clusters  will 
be  increased.  Because  the  studies  finding  short-range 
order  focus  on  the  tellurium-centred  five-atom  clusters  of 
the  form  Hg^.^Cd,  rather  than  on  cation-centred  clus¬ 
ters  of  the  form  Hg,2.,Cd„  higher-level  five-atom  clus¬ 
ter-cluster  correlations  must  be  known  to  predict  the 
effects  on  the  vacancy  populations. 

Next  we  examine  the  defects  which  accommodate 
excess  Hg  in  the  solid.  The  existence  region  for  HgCdTe 
is  always  tellurium  rich,  and  thus  the  native  defects  which 
accommodate  excess  tellurium  are  expected  to  dominate. 
For  these  equations  in  table  2,  K°  is  given  by 

=  Co ‘(kD-’'^(2nm„,)-»'^h’  exp(ASv,./k)  (36) 
=  Co '(kT)‘’(2am„, )■*)»* exp(AS„„./k)  (37) 
and 

=  Co(k7T’'^(27im„,)-^'^fi>cxp(AS„„/k).  (38) 

If  we  assume  the  change  in  entropy  is  comparable  for  ail 
three  defects,  we  find 

(39) 

L^tcJ 

and 

(40) 

I’TeJ 

for  T  =  500  °C  and  P^^  =  1  atm.  From  equations  (39) 
and  (40)  we  see  that  the  tellurium  vacancy  is  a  minority 
defect  species.  For  the  pressure  and  temperature  consid¬ 
ered,  the  density  of  Hg  antisites  is  predicted  to  be 
comparable  to  the  density  of  Hg  interstitials.  Because  the 
ratio  of  [Hgj,]  to  [Hg,]  is  nearly  unity,  any  errors  in  the 
calculation  of  the  activation  energy  could  push  the 
balance  toward  one  side  or  the  other.  Thus  we  must 

C6S 


depend  on  the  next  generation  of  calculations,  with  the 
ASA  removed  and  full  relaxation  included,  plus  a  quanti¬ 
tative  comparison  of  the  entropy  differences  between  the 
mercury  antisite  and  the  tellurium  interstitital.  to  deter¬ 
mine  the  dominant  defect  in  this  class. 

As  mentioned  above,  we  have  shown  that  the  tellur¬ 
ium  vacancy  formation  energy  varies  significantly  with 
the  alloy  environment.  Because  the  tellurium  vacancy  is 
not  expected  to  be  a  dominant  defect  in  HgCdTe.  and  the 
tellurium  diffuses  by  an  interstitial  mechanism,  we  do  not 
expect  any  measurable  manifestation  of  this  variation. 
On  the  other  hand,  the  Hg  antisite  may  be  the  dominant 
Hg-excess  defect,  and  its  formation  energy  may  vary 
significantly  with  the  alloy  environment.  We  are  cur¬ 
rently  calculating  the  magnitude  of  this  variation. 


33.  Diffiision 

HgCdTe  exhibits  a  complex  tracer  difusion  profile,  with 
both  a  fast  and  a  slow  branch.  The  fast  branch  is 
attributed  to  a  mechanism  with  vacancy  and  interstitial 
diffusion  in  parallel  where  the  dominant  diffuser  is  deter¬ 
mined  by  the  pressure  and  temperature,  while  the  slow 
component  fits  a  mechanism  with  vacancy  and  intersti¬ 
tial  diffusion  in  series  [15].  The  activation  energies  for  the 
fast  branch  as  discussed  previously  are  1.90  eV  and 
0.67  eV  for  the  vacancy  and  interstitial  mechanisms, 
respectively.  Our  calculated  formation  energy  for  the 
mercury  interstitials  are  0.84  and  0.98  eV  for  the  anion- 
and  cation-surrounded  tetrahedral  interstitial  sites,  re¬ 
spectively,  and  the  experimental  formation  energy  for  the 
mercury  vacancy  is  2.01  eV.  Comparing  these  energies 
with  the  experimental  activation  energies  we  find  close 
agreement,  indicating  that  the  migration  energy  contri¬ 
bution  to  the  diffusion  activation  energies  are  small  for 
both  interstitials  and  vacancies. 

In  the  vacancy  diffusion  case  we  are  comparing  two 
experimental  numbers  for  the  diffusion  and  formation 
energies.  Because  the  diffusion  energy  (I.94eV)  is  the 
sum  of  the  formation  and  migration  energy,  the  fact  that 
the  vacancy  formation  energy  (2.01  eV)  is  larger  but 
should  be  smaller  is  an  indication  that  something  is 
amiss,  and  the  experiments  should  be  repeated  with  the 
goal  of  attaining  higher  accuracy. 

There  is  a  similar  discrepancy  in  the  interstitial  diffu¬ 
sion  case,  but  now  we  are  comparing  the  experimental 
diffusion  activation  energy  0.67  eV  with  a  theoretical 
formation  energy  0.89  eV.  We  know  that  refinements  to 
the  theory  will  lower  the  predicted  value.  These  refine¬ 
ments  need  to  be  done  before  more  definitive  conclusions 
can  be  made. 

The  difference  between  the  two  numbers  obtained  for 
cation-  and  anion-surrounded  interstitital  sites  (0.84  eV 
and  0.98  eV)  sets  a  lower  bound  on  the  interstitial 
migration  energy.  Most  of  the  interstitial  Hg  will  sit  on 
the  lower-energy  anion-surrounded  site,  and  migrate 
through  the  intermediate-energy  cation-surrounded 
sites.  Examination  of  the  lattice  arrangement  between 
these  sites  leads  us  to  believe  that  the  potential  profile  is 
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unlikely  to  have  a  large  barrier  between  the  two  classes  of 
site,  and,  as  a  consequence,  the  interstitial  migration 
barrier  should  be  quite  low. 

In  a  recent  experiment  on  mercury  diffusion  in  ion- 
implantation-damaged  HgCdTe,  an  activation  energy  of 
several  tenths  of  an  electron  volt  was  measured  [40].  The 
disparate  results  can  be  interpreted  as  a  measure  of  only 
the  defect  migration  contribution  to  the  diffusion  activa¬ 
tion  energy,  since  defects  in  excess  of  the  equilbirium 
concentration  were  probably  formed  during  implanta¬ 
tion.  It  is  not  evident  that  the  measured  activation  energy 
corresponds  to  the  vacancy  or  the  interstitial  mechanism. 
The  conclusion  that  the  diffusion  activation  energies  are 
largely  defect  formation  energies,  with  the  migration 
energies  being  much  smaller,  is  in  agreement  with  the 
above  interpretation  of  the  Richter  and  Kalish  [40] 
experiment. 


4.  Conclusions 

We  have  incorporated  our  calculated  defect  energies  into 
the  mass  action  equations  for  the  neutral  defects  in 
HgCdTe.  In  agreement  with  experiment,  we  find  the 
mercury  vacancy  to  be  the  dominant  native  defect  in 
tellurium-rich  material.  We  also  find  the  mercury  antisite 
and  interstitial  defect  densities  to  be  comparable,  al¬ 
though  a  better  calculation  of  the  vibrational  entropy  is 
needed  to  confirm  this  result.  Comparing  the  defect 
formation  energies  of  Vydyanath  [13, 14]  and  our  theory 
to  the  diffusion  measurements  by  Chen  [16]  and  by  Tang 
and  Stevenson  [15],  we  find  agreement  with  their  diffu¬ 
sion  activation  energies  for  both  the  vacancy  and  the 
interstitial  mechanisms,  if  we  assume  that  the  migration 
energy  is  small  in  both  cases.  Small  migration  energies 
are  consistent  with  the  measurement  of  Richter  and 
Kaiisli  [40]  and  our  theory.  Further  work  is  in  progress 
to  incorporate  the  defect  charge  states  into  the  calcula¬ 
tion,  and  also  to  calculate  the  fully  relaxed  defect  energies 
with  the  full  potential  lmto. 

If  the  cation  migration  energies  are  as  small  as  we  are 
suggesting,  then  some  cation  motion  will  be  present  even 
at  room  temperature.  This  implies  that  the  crystal  cnn 
equilibrate  to  its  room  temperature  order-disorder  cor¬ 
relation  state.  Most  semiconductor  alloys  at  room  tem¬ 
perature  are  in  metastable  correlation  states  frozen  to  the 
temperature  where  diffusion  stopped  as  they  were  cooled. 
How  this  affects  devices  remains  to  be  determined. 

We  predict  that  near  a  Te  inclusion  there  will  be  an 
'atmosphere'  of  defects  differing  from  the  one  the  bulk 
solid  has  in  equilibrium  with  a  Hg  v,.  our.  In  this 
atmosphere  the  ratio  of  the  concentrations  of  the  tellur¬ 
ium  antisites  to  Hg  vacancies  will  be  increased.  The 
tellurium  antisite  concentration  may  still  be  too  small  to 
affect  any  device  properties,  but  if  the  Hg  vacancy 
concentration  is  greater  near  the  inclusions  than  in  the 
bulk,  this  will  be  a  source  of  spatial  variability  in  the 
material.  This  is  a  potential  source  of  the  worms  in  lpe 
material.  We  are  cautious  about  this  suggestion  because 


TEM  studies  of  annealed  lpe  material  indicate  that  there 
are  few,  if  any,  Te  inclusions  remaining  [7], 

First-principles  theories  are  now  fast  enough  and 
accurate  enough  for  practical  use  in  process  and  perfor¬ 
mance  modelling.  An  effort  to  capture  the  results  of  many 
experiments  into  consistent  models,  and  thereby  increase 
their  reliability,  has,  as  we  have  tried  to  demonstrate  in 
this  paper,  become  a  realistic  goal. 
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1.  INTRODUCTION 

It  is  usual  to  divide  metal  alloy  phases  into  three  classes:  electrochemical  compounds, 
size  factor  compoxmds  and  electron  compounds.^  (The  term  “compoimd”  is  used  rather 
broadly  here  to  include  secondary  solid  solutions.)  Electrochemical  compounds  are  those 
that  form  following  classical  rules  of  chemical  bonding  in  which  the  coordination  num¬ 
bers  reflect  the  valencies  of  the  pure  components.  Size  factor  compounds  embody  the 
famous  Hume-Rothery  “15%  rule”,  and  include  substances  such  as  Laves  phases  and 
other  topologically  close-packed  structures  whose  atoms  are  of  largely  differing  sizes  and 
have  therefore  only  limited  ranges  of  solid  solubility.  An  intriguing  principle  in  the  theory 
of  allo3rs  is  the  abundance  of  alloys  in  the  third  category,  the  Hume-Rothery  “electron 
compounds”:  there  appears  a  strong  correlation  between  the  number  of  valence  electrons 
per  unit  cell,  valence  electron  concentration  or  electron  per  atom  ratio  and  the  crystal 
structure  adopted  by  certain  alloy  phases,  be  it  fee,  bcc,  hep,  /5-Mn,  7-  or  e-brass  struc¬ 
tures.  For  example,  the  appearance  of  the  /9-phase  at  the  solid  solubility  limit  of  primary 
Q-alloys  of  noble  metals  with  elements  in  groups  12, 13  and  14  mostly  occurs  at  a  valence 
electron  concentration  around  1.4  .  In  the  light  of  modem  density  functional  theories  of 
electronic  structure,  it  seems  all  the  more  remarkable  that  such  a  simple  nde  can  hold 
true  over  ranges  of  alloys  that  comprise  simple,  noble  and  transition  metals.  However, 
the  phenomenological  evidence  is  hard  to  dispute:  both  the  large  number  of  electron  com- 
poimds,  and  the  behaviour  of  defect  structures.  For  example,  non-stoichiometric  7-brass 
will  incorporate  vacancies  seemingly  in  order  to  maintain  a  V2dence  electron  concentration 
that  corresponds  to  the  electron  per  atom  ratio  of  21/13  which  stabilises  the  complex 
cubic  structure.  NiAl  has  an  ordered  bcc  structure  with  an  electron/atom  ratio  of  3/2 
(transition  metals  are  assigned  a  valency  of  zero  in  the  Hume-Rothery  theory);  in  com¬ 
pounds  containing  less  than  50%  Ni,  some  of  the  Ni-sites  remain  vacant  and  it  has  been 
suggested^  that  this  is  to  maintain  a  valence  electron  concentration  favorable  to  the 
formation  of  bcc  phases.  It  is  possible  that  this  phenomenon  has  nothing  to  do  with  the 
valence  electron  concentration.  Instead  it  may  reflect  the  large  formation  energy  of  an  A1 
antisite  defect  in  the  highly  ordering  Ni-Al  system  compared  with  the  vacancy  formation 
energy.  In  fact  we  believe  that  the  Hume-Rothery  e/a  rule  is  unlikely  to  be  operative, 
for  example,  in  Ni-Al  or  Ni-Zn  alloys,  and  that  since  the  rule  was  stated  many  merely 
coincidental  observations  have  been  attributed  to  it.  In  the  case  of  noble  metal  alloys 
with  groups  12,  13  and  14,  however,  we  shall  show  that  the  e/a  nile  can  be  seen  as  a 
direct  consequence  of  the  general  validity  of  the  rigid  band  model,  as  first  suspected  by 
Jones^  in  1937. 
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It  has  been  the  aim  of  many  theoretical  studies  over  the  past  55  years  to  explain  the 
electron  compounds  in  terms  of  electron  theory;  good  and  comprehensive  reviews  of  this 
effort  can  be  foimd  in  recent  works  by  Cottrell^  and  Hafner.®  A  very  detailed  discussion 
of  the  pseudopotential  approach  can  be  found  in  Heine  and  Weaire.^  A  detailed  review 
of  experimental  data  and  rigid-band  models  has  been  made  by  Massalski  and  Mizutani.^ 
Many  studies  have  concentrated  on  the  archetypal  system  Cu-Zn,  since  this  reveals  most 
of  the  principles  of  the  theory,  and  also  since  this  vras  the  subject  of  the  pioneering  study 
of  the  electron  compounds^  resulting  in  the  famous  “Jones  Theory  of  Alloying.”  A  glance 
at  the  phase  diagram  (see  figure  7a)  shows  that  while  Zn  is  hep  and  has  a  wide  range 
of  solubility  in  Cu,  which  is  fee;  between  the  solubility  limits  0.3  <  x  <  0.6  CuiZni-* 
alloys  contain  bcc  ^-brass.  Thus  as  one  adds  Zn  to  Cu,  varying  the  valence  electron 
concentration,  n,  continuously  between  one  and  two,  the  structure  of  the  alloys  changes 
in  the  sequence  fee  — ►  bcc  — ►  hep,  the  phase  transitions  occurring  at  approximately 
n  =  1.4  and  n  =  1.8  (we  omit  dicussion  of  the  complex  cubic  7- brass  phase  occurring 
between  n  =  1.6  and  n  =  1.8).  The  purpose  of  Jones’  theory  was  to  explain  the  transition 
from  fee  to  bcc  at  a  valence  electron  concentration  of  1.4  . 

Jones’  theory  was  based  on  the  following  three  assumptions. 

1.  The  energy  difference  between  phases  is  given  entirely  by  the  difference  in  the  band- 
structure  energy  in  the  two  crystal  structures.  In  other  words,  given  a  bandstructure 
for  the  crystalline  solid  (that  is  to  say,  the  relationship  e(k)  between  wavenumber  k 
and  electron  eigenenergy)  and  its  associated  density  of  states  g(E),  the  bandstructure 
energy  is 


^band  ~~  ^  ^(^') 
i  occ. 
t^F 

=  /  €g{e)dt. 


(1) 


where  the  siim  is  taken  over  occupied  electronic  states.  The  Fermi  Cjr  energy  (max¬ 
imum  occupied  energy  level)  is  determined  by  the  valence  electron  concentration  n 
according  to 

n=f^g{e)d€.  (2) 

J^OO 

2.  In  order  to  compute  the  energy  difference,  one  can  assume  energy  bands  which  deviate 
&om  free-electron  behaviour  only  as  they  approach  Bragg  reflecting  planes  in  the 
Brillouin  zone  and  only  in  the  direction  normal  to  these  planes.  Only  under  these 
conditions  could  the  electron  dispersion  e(k)  be  written  in  closed  form. 

3.  The  rigid  band  approximation.  This  states  that  the  effect  of  alloying  is  to  add  or 
subtract  electrons  without  altering  the  bandstructure.  In  other  words,  we  can  ztssxime 
that  the  bandstructures  of  all  CuxZnt.z  alloys  are  same:  they  only  differ  in  the 
positions  of  their  Fermi  energies.  A  consequence  of  the  rigid  band  approximation 
is  the  implication  that  alloys  are  disordered  at  all  compositions.  This  will  also  be 
implicit  in  our  work,  so  that  we  are  not  able  to  account  for  the  ordering  of  the  fi- 
phase  into  the  0'  (CsCl)  structure.  Both  these  two  approximations  can  be  removed 
within  the  Coherent  Potential  Approximation  and  General  Perturbation  Method.^ 
As  far  as  the  density  of  states  is  concerned,  this  is  not  necessary  in  the  case  of  Cu- 
Zn,  where  the  rigid  band  approximation  has  shown  to  be  valid  in  neutron  scattering 
experiments.®  On  the  other  hand,  in  the  Cu-Ni  system  our  approach  would  not  be 
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justified.®  We  will  also  find  that  the  rigid  band  model  severely  overestimates  the  heat 
of  formation  of  /3-CuZn.  In  spite  of  these  reservations  we  will  keep  to  the  rigid  band 
approximation  so  that  we  are  able  interpret  our  results  within  the  framework  of  Jones’ 
theory  of  alloying.  Also,  since  the  phase  diagram  shows  no  af^  equilibrium  at  0*K, 
our  purpose  here  is  only  to  expose  general  principles  concerning  the  Hume-Rothery 
e/a  rule. 

Recent  advances  are  able  to  address  the  first  two  assumptions  more  clearly  than 
previously.  The  assumption  1  is  now  known  to  be  rigorously  provable  to  first  order  in 
density  functional  theory,  and  is  known  as  the  structural  energy  difference  theorem.^®  As 
Jones  pointed  out,^^  if  assumption  2  is  not  invoked,  then  the  evaluation  of  the  density  of 
states  function  becomes  a  formidable  task.  (Indeed  even  then,  a  further  approximation 
needed  to  made  by  Jones:^  we  will  return  to  this  in  section  4.2.1  .)  This  is  particularly 
so,  since  the  Jones  theory  is  centred  in  the  nature  of  the  density  at  and  near  Van  Hove 
singularities  and  these  are  difficult  to  reproduce  even  with  modem  methods  of  Brillouin 
zone  integration.  Our  present  paper  is  devoted  to  an  examination  of  the  theory  of  Cu-Zn 
alloys  in  which  we  apply  the  structural  energy  difference  theorem  to  self-consistent  rigid 
band  models.  Using  an  advanced  technique  for  Brillouin  zone  integration,  we  explicitly 
show  the  Van  Hove  singularities  responsible  for  the  transitions  from  fee  to  bee  to  hep 
as  the  valence  electron  concentration  is  varied.  The  structure  of  the  paper  is  as  follows. 
In  section  2  we  describe  Jones’  seminal  work,  its  popularisations,  its  relation  to  other 
current  theories  and  its  interpretation  in  terms  of  general  arguments  involving  typical 
shapes  of  the  density  of  states.  In  section  3  we  show  how  two  recent  advances,  namely 
the  structural  energy  difference  theorem  and  the  analytic  quadratic  method  of  integration 
allow  us  to  remove  all  of  Jones’  assumptions  except  the  rigid  band  approximation  for  the 
alloy.  In  section  4,  we  make  density  functional  calculations  of  phase  stability  in  the  Cu-Zn 
system  using  both  the  self-consistent  bandstmeture  in  the  local  density  approximation, 
and  various  model  bandstructures  in  an  attempt  to  recover  Jones’  original  theory.  We 
confirm  the  validity  of  the  rigid  band  approximation  in  this  context,  and  we  are  able  to 
make  detailed  statements  about  the  way  in  which  the  positions  of  Van  Hove  singularities 
in  the  bandstmeture  drive  the  transitions  between  phases.  In  section  5  can  be  found  a 
stimmary  and  our  conclusions. 

2.  THEORIES  OF  THE  ELECTRON  COMPOUNDS 
2.1  The  Mott  and  Jones,  and  Jones  theories 

Unfortimately,  considerable  confusion  has  accumulated  in  the  metallurgical  literature 
surroimding  Jones’  theory  of  ^-phase  alloys.  This  situation  arises  from  the  fact  that  two 
quite  different  explanations  of  the  Hume-Rothery  rule  are  attributed  to  Jones.  We  will 
call  these  the  Mott  and  Jones,  and  the  Jones  models  and  will  describe  them  separately  in 
the  following  two  paragraphs.  The  situation  has  been  very  clearly  unravelled  by  Hume- 
Rothery.^®  We  also  refer  the  reader  to  discussions  in  Barrett  and  Massalsld,^^  and  Heine 
and  Weaire.® 

Initially,  Hume-Rothery^^  observed  that  in  a  large  number  of  binary  compoimds  of 
univalent  fee  metals,  the  phase  diagram  displayed  a  limit  of  solubility  (a|(a  +  /9)  phase 
boundary)  at  a  valence  electron  concoitration  of  about  1.4  .  Furthermore  the  appear¬ 
ance  of  a  single  bcc  phase  ((o  -1-  0)\I3  phase  boundary)  occurred  at  a  valence  electron 
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concentration  around  1.5  .  Mott  and  Jones^^  advanced  the  following  explanation  for  the 
phenomenon.  K  we  imagine  a  free-electron  Fermi  sphere  expanding  inside  the  fee  Brillouin 
zone  as  the  valence  electron  concentration  is  increased  between  one  and  two,  we  find  that 
the  sphere  makes  contact  with  the  {111}  zone  faces  at  a  valence  electron  concentration 
of  1.36  .  Similarly  in  the  bcc  Brillouin  zone  the  sphere  will  first  contact  the  {011}  faces 
when  the  valence  electron  concentration  reaches  1.48  .  Mott  and  Jones  pointed  out  that 
the  density  of  states  would  go  through  a  maximum  at  the  point  where  the  Fermi  sphere 
made  contact  with  the  zone  boundauries.  This  is  because  Bragg-reflection  causes  otherwise 
free-electron-like  bands  to  deviate  and  become  fiat.  This  will  lead  to  a  cusp-like  peak  in 
the  density  of  states.  If,  furthermore,  we  compare  two  structures,  one  of  whose  density 
of  states  is  a  free-electron  parabola,  while  the  other  has  a  maximum  due  to  this  zone 
boimdaxy  effect,  then  one  can  argue  qualitatively  that  when  the  Fermi  energy  lies  on  the 
maximum,  the  latter  structure  will  be  stable  since  electrons  can  be  accommodated  under 
the  peak  in  the  density  of  states  at  lower  energies  than  in  the  competing  structure.  There¬ 
fore  as  the  valence  electron  concentration  in  an  fee  solid  solution  is  increased  from  one,  at 
a  value  of  1.36  the  density  of  states  in  the  stable  fee  phase  will  begin  to  decrease  and  that 
phase  will  become  less  stable  than  a  competing  structure  with  firee-electron-like  density  of 
states  since  the  decreasing  density  of  states  will  force  electrons  into  higher  energy  states 
and  de-stabilise  the  fee  phase.  This  is  said  to  explain  the  formation  of  jd-phase  alloys  at 
valence  electron  concentrations  around  1.4  .  At  a  valence  electron  concentration  of  1.48, 
the  bcc  phase  will  be  most  stable;  this  is  the  explanation  for  the  ubiquitous  occurrence 
of  yd-phase  alloys  at  the  famous  Hume-Rothery  electron/atom  ratio  of  3/2.  Clearly,  the 
Mott  and  Jones  model  implies  that  at  a  valence  electron  concentration  of  one  (that  is  to 
say,  in  Cu)  the  Fermi  surface  is  not  in  contact  with  any  of  the  faces  of  the  fee  Brillouin 
zone.  Measurement  of  the  Fermi  surface  of  Cu  by  Pippard  in  1957  showed  that  this  was 
incorrect,  and  that  the  Fermi  surface  of  Cu  formed  elongated  necks  at  the  {111}  Brillouin 
zone  boxmdaries.^®  This  is  often  quoted  as  indicative  of  the  complete  failure  of  Jones’ 
theory.  An  important  fallacy  in  the  Mott  and  Jones  theory  is  that  if  the  bands  were 
free-electron-like  so  that  contact  with  the  zone  boundaries  was  made  at  the  firee-electron 
values  of  1.36  and  1.48,  then  there  would  be  no  peaks  in  the  density  of  states. Con¬ 
versely,  if  there  were  peaks  (arising  from  deviations  from  free-electron  behaviour)  then 
they  would  not  occur  at  the  free-electron  values  but  at  lower  values  of  valence  electron 
concentration.  A  final  consequence  of  the  popularisation  of  the  Mott  and  Jones  model  is 
the  statement  encountered  in  several  elementary  texts  that  stability  of  a  structure  arises 
when  the  Fermi  level  falls  in  a  maximum  in  the  density  of  states,  and  that  a  decreasing 
density  of  states  indicates  instability  with  respect  to  a  competing  phase.  This  rather 
counter-intuitive  statement  is  indeed  incorrect,  as  we  shall  show  in  section  2.2  . 

The  above  model  is  frequently  referred  to  as  the  Jones  theory  of  alloying,  even  in 
modem  metallurgical  textbooks. But  Jones’  work  of  1937  is  distingmshed  from  the  Mott 
and  Jones  model  in  two  respects.  Firstly,  Jones  explicitly  incorporated  the  large  band 
gap  observed  on  the  Bragg-reflecting  {111}  planes  in  fee  Cu;  and  secondly,  rather  than 
making  qualitative  arguments  Jones  numerically  integrated  his  model  density  of  states  to 
distinguish  the  energies  of  the  fee  and  bcc  phases.  The  characterisitic  peaked  structure  of 
the  fee  and  bcc  densities  of  states  is  shown  in  figure  la.  These  were  obtained  by  Jones  from 
approximate  Brillouin  zone  integrals  of  a  model  dispersion  for  Cu  which  included  a  band 
gap  of  4.1eV  deduced  from  optical  measurements.  The  effect  of  the  large  gap  is  to  move 
the  peaks  at  the  zone  boundaries  to  valence  electron  concentrations  much  lower  than  the 
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free-electron  values.  In  fact  the  Fermi  level  in  Cu  indicates  that  in  Jones’  model  the  Fermi 
surface  was  about  to  make  contact  with  the  zone  boundaries  (see  figure  la).  Numerical 
integrations  of  the  densities  of  states  puts  the  a|(a  +  0)  and  (a  +  phase  boundaries 
at  valence  electron  concentrations  of  1.41  and  1.45  in  very  good  agreement  with  what 
was  known  of  the  phase  diagram  in  1937.  These  values  do  not,  of  course,  correspond 
with  the  peaks  in  Jones’  densities  of  states,  which  occur  at  approximately  1  zmd  1.2 
electrons/ atom. In  fact  fee  is  at  its  most  stable  at  a  valence  electron  concentration 
corresponding  to  the  peak  in  the  bcc  density  of  states  (see  figure  lb).  This  result  is 
completely  at  odds  with  the  conventional  wisdom  since  at  this  point  the  fee  density 
of  states  is  rapidly  decreasing;  however  we  shall  see  in  section  2.2  that  this  is  in  fact 
quite  consistent  with  the  shapes  of  the  densities  of  states.  Much  of  the  confusion  in 
the  literature,  in  fact,  has  arisen  from  a  mixing  up  of  the  Mott  and  Jones,  and  Jones 
models.  In  most  textbooks^'^^  we  find  Jones’  figure  (our  figure  la)  reproduced,  but  with 
the  peaks  in  the  density  of  states  labelled  with  the  free-electron  values  1.36  and  1.48 
electrons  per  atom.  Furthermore,  it  is  claimed  that  fee  becomes  unstable  with  respect 
to  bcc  as  soon  as  the  fee  density  of  states  begins  to  decrease.  This  is  not  Jones’  resxilt 
as  can  be  seen  from  figure  lb;  however,  it  is  not  immediately  obvious  since  the  density 
of  states  is  plotted  against  energy  whereas  the  structural  energy  differences  are  plotted 
against  valence  electron  concentration.  It  is  often  clearer  to  show  both  quantities  plotted 
against  n.  Then  for  a  given  bandfilling,  one  can  immediately  compare  the  densities  of 
states  and  the  energy  difference. 

Heine  and  Weaire^  have  pointed  out  the  important  distinction  between  theories  that 
attribute  special  stability  to  structures  in  which  contact  is  made  between  a  BriUouin  zone 
boundary  and  the  Fermi  surface  and  those  that  involve  contact  with  the  Fermi  sphere^ 
which  is  the  Fermi  surface  the  alloy  would  have  if  the  bands  were  truly  free-electron  like. 
There  is  no  justification  in  the  former  case,  as  can  already  be  seen  from  Jones’  results 
and  will  be  demonstrated  below.  The  Mott  and  Jones  assertion  that  stability  arises  when 
the  Fermi  sphere  makes  contact  is  phenomenologically  correct,  and  we  will  see  why  it 
approximately  works  in  section  2.2.  This  is  also  the  result  that  emerges  from  second 
order  perturbation  theory  because  the  leading  terms  in  these  theories  are  indeed  response 
functions  of  a  free-electron  gas  which  have  weak  singularities  at  twice  the  free-electron 
Fermi  wavevector.  However,  these  singxilarities  do  not  immediately  lead  to  minima  in 
the  bandstructure  energy.  Therefore  even  in  pseudopotential  perturbation  theory  there 
is  no  complete  explanation  of  the  Hume-Rothery  rule  of  electron  compoimds.  It  will  be 
necessary  to  accurately  calculate  energy  bands  and  carefully  sum  the  occupied  states. 
This  is  the  approach  we  shall  adopt  here.  There  are  two  advantages  over  pseudopotential 
pertxirbation  theory,  on  which  attempts  to  imderstand  the  Hume-Rothery  rule  has  invari¬ 
ably  been  based  in  the  past.^>^  Firstly,  our  bandstructure  energies  are  exact  to  all  orders; 
and  secondly  we  will  properly  include  sd-hybridisation  in  our  calculations.  We  will  show 
explicitly  that  neglect  of  d-band  effects  leads  to  an  incorrect  prediction  of  the  structure 
of  pure  Cu. 

2.2  General  arguments 

Following  Jones^^  and  Massalski  and  Mizutani,^  it  is  possible  to  make  some  quite 
general  arguments  in  the  context  of  which  we  will  be  able  to  make  an  analysis  of  our 
results  below.  At  the  same  time  the  shortcomings  of  the  Mott  and  Jones  theory  become 
immediately  evident. 
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Given  the  density  of  states  of  two  different  structures  as  function  of  the  energy,  it  is 
difficult  to  envisage  the  difference  of  the  bandstructure  energies  as  function  of  the  valence 
electron  concentration,  since  the  interplay  between  two  different  Fermi  levels  must  be 
considered.  In  order  to  understand  the  behaviour  of  the  bandstructxire  energy,  one  may 
exploit  the  fact  that  that  the  derivatives  of  with  respect  to  the  valence  electron 

concentration  from  equations  (1)  and  (2)  are  given  by 


(3) 


Taking  the  difference  of  eq.  3  for  two  different  structures,  we  see  that  the  derivative  of  the 
energy  difference  is  zero  when  the  two  Fermi  energies  are  equal.  At  these  values  of  n,  the 
difference  of  the  two  bandstructure  energies  is  therefore  extremal  (except  in  the 

special  case  of  a  saddlepoint).  To  determine  whether  the  extremiun  is  a  maximum  or  a 
minimum,  we  take  the  difference  of  eq.  (4)  between  the  two  structures.  It  follows  that  the 
more  stable  structure  at  the  extremum  is  the  one  with  the  lower  density  of  states  at  the 
Fermi  energy .t  This  suggests  the  following  procedure  for  analysing  the  energy  difference 
between  two  phases.  One  should  first  search  for  those  n  for  which  the  Fermi  energies  are 
equal  and  the  energy  difference  is  extremal.  This  is  easily  done  by  varying  a  common 
Fermi  energy  until  the  occupied  parts  of  the  two  densities  of  states  have  equal  areas.  By 
comparing  the  two  densities  of  states  at  the  Fermi  energy,  the  more  stable  structure  can  be 
determined.  The  critical  valence  electron  concentrations  for  the  aossover  between  phases 
(where  the  two  bandstructure  energies  are  equal)  can  be  estimated  from  the  positions  of 
the  extrema. 

We  ue  now  in  a  position  to  point  out  the  basic  fallacy  in  the  Mott  and  Jones  theory. 
The  central  argument  of  the  theory  is  that  the  instability  of  a  phase  is  associated  with  the 
steep  drop  behind  a  peak  in  the  density  of  states  (which  is  correct)  and  that  the  instability 
occurs  immediately  after  states  behind  the  peak  start  filling  up  (which  is  incorrect).  To 
see  the  error,  consider  figtire  Ic  which  shows  the  density  of  states  for  three  typical  cases. 
The  first,  labelled  “P”,  is  the  free-electron  parabola.  The  second,  labeled  “P”  shows  the 
typical  behaviour  when  the  free-electron  bands  are  perturbed  by  a  set  of  Bragg  planes. 
The  third  curve,  '*B”,  is  similar  to  curve  F,  but  corresponds  to  a  Bragg  plane  further 
from  the  origin.  For  the  Jones  model  of  the  Cu-Zn  a  — >  0  transition,  F  and  B  would 
correspond  to  fee  and  bcc,  respectively.  Comparing  phases  F  and  P,  it  is  seen  that  the 
two  Fermi  energies  are  equal  at  energy  ej,  at  which  curve  F  rejoins  the  free-electron 
parabola,  as  well  as  all  higher  energies.  Note  that  the  triangular  areas  above  and  below 
the  free-electron  parabola  must  be  equal.  The  two  densities  of  states  are  equal  at  e^,  so 
that  the  above  criterion  for  deciding  between  a  maximum  or  minimxim  does  not  apply. 

t  The  converse  applies,  however  if  AP^„j  is  a  minimum  and  greater  than  zero  or  a 
maximum  and  less  than  zero.  Under  these  circumstances  the  most  stable  structure  will  be 
that  having  the  highest  g{if).  This  case  will  not  arise  in  the  present  context,  but  in  other 
situations  such  as  application  to  the  energy  ordering  in  transition  metal  trialuminides^^ 
we  have  found  exactly  this  state  of  affairs.  In  that  case  a  naive  appraisal  of  the  densities 
of  states  would  have  led  to  an  incorrect  structure  prediction. 
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However,  ctirve  F  has  clearly  moved  electrons  to  lower  energies  and  is  more  stable.  In  fact, 
phase  F  has  a  lower  bandstructure  energy  at  all  n,  in  disagreement  with  the  statement 
that  F  shotild  become  unstable  as  soon  as  the  peak  at  point  A  is  passed. 

Turning  to  the  more  relevant  case  of  phases  F  and  B,  one  sees  that  equal  density  of 
states  areas  and  the  extremal  bandstructure  energy  difference  occur  at  the  energy  labelled 
£2>  There,  the  density  of  states  of  phase  F  lies  bdow  that  of  B,  so  that  phase  F  is  more 
stable.  We  see  that  phase  F  does  in  fact  have  a  lower  energy  when  the  Femoi  energy 
reaches  point  A,  as  disctissed  by  Mott  and  Jones, but  then  becomes  ever  more  stable 
with  increasing  valence  electron  concentration  imtil  point  £2  reached.  The  reason  is 
easy  to  understand.  We  start  from  the  situation  in  which  F  should  be  the  most  stable 
according  to  the  Mott  and  Jones  model,  namely  when  the  F  and  B  Fermi  energies  lie 
at  point  A  and  at  some  higher  energy,  respectively.  Incrementing  the  electron  number  in 
aTna.ll  steps,  each  extra  amount  of  charge  enters  at  a  lower  Fermi  energy  for  F  than  for 
B  2md  serves  to  further  stabilize  phase  F.  This  continues  well  into  the  ramge  where  the 
B-density  of  states  lies  below  the  B-density  of  states,  until  finally  the  two  Fermi  energies 
are  equal  and  the  maximal  bandstructure  energy  difference  is  reached.  Consequently,  and 
contrary  to  the  Mott  and  Jones  model,  the  crossover  at  which  phase  B  becomes  stable 
relative  to  F  must  lie  at  an  even  higher  valence  electron  concentration. 

These  examples  have  demonstrated  two  interesting  facts.  Firstly,  the  phase  transit 
tion  does  not  happen  as  soon  as  states  after  the  peak  at  point  A  are  filled,  but  comes 
considerably  later.  Secondly,  unless  the  density  of  states  of  the  second  phase  deviates 
from  the  free^electron  parabola,  there  will  be  no  transition  at  all.  Therefore  the  critical 
valence  electron  concentration  always  depends  on  the  shape  of  both  densities  of  states, 
and  cannot  be  deduced  from  pronoinent  featmes  of  the  first  alone.  We  can  now  come  to 
conclusions  concerning  the  question  of  the  Fermi  surface  or  Fermi  sphere  touching  the 
Brillouin  zone  face.  Clearly  there  is  no  special  significance  to  the  valence  electron  con¬ 
centration  at  which  the  Fermi  surface  touches;  this  is  just  point  A,  and  all  we  know  is 
that  the  transition  is  somewhere  higher  up.  However,  it  can  be  expected  that  the  valence 
electron  concentration  for  which  the  free-electron  sphere  touches  will  be  a  useful  guide  in 
many  cases.  This  is  simply  because  two  effects  tend  to  cancel:  the  peak  at  point  A  always 
lies  below  this  free-electron  valence  electron  concentration,  while  the  energy  crossover  will 
lie  a  comparable  amoxmt  above  the  peak.  Given  a  sequence  of  phases  as  in  the  Cu-Zn 
system,  the  critical  valence  electron  concentrations  deduced  from  the  free-electron  sphere 
will  probably  give  the  correct  sequence  of  the  phases  and  a  reasonable  estimate  of  the 
phase  boundaries. 

It  is  obvious  from  the  common  tangent  construction  that  the  position  of  the  phase 
boimdaries  must  depend  on  the  free  energies  of  both  phases  as  emphasised  in  the  last 
paragraph.  This  point  seems  to  have  been  neglected  in  the  literature.  In  the  simplest 
(Mott  and  Jones)  model,  one  considers  the  critical  valence  electron  concentration  at  which 
the  fee  phase  becomes  less  stable  than  bcc;  in  other  words,  the  concentration  at  which 
the  energies  of  the  two  phases  are  equal.  This  has  no  special  significance  in  determining 
phase  boundaries;  except  that  the  a|(a  +  0)  and  (a  +  0)\0  boimdaries  must  bracket  this 
point. 
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3.  THE  DENSITY  FUNCTIONAL  APPROACH 
3.1  Frozen  potential  calculations 

Although  Jones^  had  to  take  it  as  an  assumption  that  the  structural  energy  difference 
was  just  the  difference  in  bandstructure  energies,  we  now  know  that  in  density  functional 
theory  there  is  a  rigorous  proof  of  this  to  first  order.  Suppose  we  want  the  difference  in 
total  energy  between  two  monatomic  crystals  with  the  same  atomic  volume  but  having 
different  crystal  structures,  I  and  11.  Then  we  may  make  a  calculation  of  the  self-consistent 
potential  of  structure  I  and  the  bandstructure  energy  eq.  (1)  associated  with  that  poten¬ 
tial.  Then,  if  we  have  a  way  of  rigidly  transferring  this  potential  into  structure  II,  we 
may  calculate  the  bandstructure  energy  of  that  structure  in  the  frozen  potential.  Then 
the  total  energy  difference  between  structures  I  and  II  is  given  by  the  difference  in  their 
bandstructure  energies.  This  can  be  shown  to  be  an  exact  result  to  first  order  in  the  dif¬ 
ference  between  the  frozen  potential  in  structure  II  and  its  self-consistent  potential.^®'^® 
Of  all  the  bandstructxire  methods  available  to  density  functional  theory,  the  LMTO-ASA 
method^^  most  lends  itself  to  this  kind  of  calculation  since  the  potential  can  be  expressed 
completely  in  terms  of  a  set  of  fotir  "potential  parameters”  for  each  inequivalent  atom 
and  for  each  orbital  type:  a,  p  and  d.  The  frozen  potential  approach  can  be  more  accu¬ 
rate  than  a  self-consistent  LMTO-ASA  calculation.  This  is  because  the  major  source  of 
error  in  LMTO-ASA  arises  from  the  spheridisation  of  the  charge  density  in  calculating 
electrostatic  terms  in  the  total  energy.  In  the  structural  energy  difference  theorem  these 
terms  cancel  to  first  order  and  so  do  not  contribute  any  error. 

Our  procedure  for  calculating  structural  energy  differences  is  the  same  as  that  de¬ 
scribed  by  Skriver.^®  (See  also  van  SchUfgaarde  et  al^^)  In  the  present  paper,  this  method 
shall  be  called  the  local  density  approximation-rigid  band  (LDA-RB)  approach.  We  have 
chosen  to  do  all  our  calculations  at  the  measured  atomic  volume  of  )9-CuZn  (see  table  1). 
We  begin  by  making  a  self-consistent  LMTO  calculation  of  fee  Cu,  including  "combined 
correction”  and  augmenting  up  to  /max  ~  2,  i.e.,  including  s,  p  and  d  orbitals  in  the  ba¬ 
sis.  Combined  correction^^is  a  means  of  going  beyond  the  atomic  spheres  approximation 
(ASA)  in  the  LMTO  method.  By  adding  extra  terms  in  the  hamiltonian  and  overlap 
matrices  the  combined  correction  accounts  for  those  regions  of  space  in  which  the  atomic 
spheres  overlap,  or  which  are  not  included  in  any  atomic  sphere,  by  computing  integrals 
between  imaugmented  LMTO  functions  in  these  regions.  The  combined  correction  also 
includes  unaugmented  basis  functions  from  /max  +  1  to  oo  inside  the  atomic  spheres.  Al¬ 
though  the  combined  correction  somewhat  detracts  from  the  simple  elegance  of  the  ASA 
and  structural  energy  difference  theorem,  it  is  found  to  be  necessary  in  computing  the 
very  subtle  energy  differences  between  fee  and  bcc  noble  metals.  Essentially  our  approach 
is  to  take  a  trial  potential  within  which  we  calculate  energy  bands  as  accurately  as  possi¬ 
ble.  Our  self-consistent  potential  puameters  are  shown  in  table  1.  This  potential  is  then 
used  to  generate  bandstructures  for  fee,  bcc  and  hep  Cu.  The  only  differences  between 
the  structures  then  appear  in  the  structure  constant  and  combined  correction  matrices. 
(This  is  another  elegant  feature  of  the  "spherical”  methods  such  as  LMTO  in  which  the 
scattering  properties  of  the  atomic  potentials  separate  out  from  the  structure  of  the  lattice 
in  the  secular  equations.) 

From  the  bandstructures,  we  generate  densities  of  states,  which  we  then  integrate 
up  to  Fermi  energies  corresponding  to  a  number  of  electrons  per  atom  which  we  vary 
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in  increments  between  11  (Cu)  and  12  (Zn).  In  this  way,  we  are  \ising  a  rigid  band 
formulation  to  the  alloy  problem  to  investigate  the  relative  stability  of  fee,  bee  and  hep 
as  a  function  of  number  of  electrons.  We  plot  this  as  two  energy  differences,  bcc-fcc  and 
hep-fee  against  valence  electron  concentration. 

3.2  Reconstructing  the  density  of  states 


Because  the  whole  problem,  and  particularly  the  interpretation,  hinges  on  the  ex¬ 
istence  of  Van  Hove  singularites  in  the  bandstructure  at  the  Brillouin  zone  botmdaries, 
we  have  devoted  some  effort  to  the  problem  of  obtaining  the  density  of  states  from  the 
bandstructure  e(k)  where  we  evaluate  the  dgenvalues  e  of  the  hamiltonian  at  a  discrete 
set  of  reciprocal  lattice  vectors  {k}. 


This  is  a  perennial  problem  in  solid  state  theory:  namely,  how  to  construct  a  density 
of  states  function  from  eigenvalues  given  on  a  mesh  of  k-points.  Solutions  fall  generally 
into  two  classes:  those  that  use  only  the  information  given  at  each  of  the  discrete  set 
(these  are  called  “sampling”  methods);  and  those  which  interpolate  the  bandstructure 
within  the  k-mesh  and  then  calculate  the  density  of  states  analytically  from  the  interpo¬ 
lated  bandstructure.  Of  the  latter  class,  the  most  widely  used  is  the  linear  tetrahedron 
method  of  Jepsen  and  Andersen.^^  Here,  the  reciprocal  unit  cell  is  divided  into  tetrahedra 
with  apexes  lying  on  the  k-mesh.  The  bands  are  then  interpolated  linearly  within  each 
tetrahedron  resulting  in  a  continuous  (but  not  differentiable)  bandstructure  everywhere 
within  the  unit  cell.  For  this  approximate  bandstructme,  the  density  of  states  is  calcu¬ 
lated  analytically.  A  serious  drawback  is  that  the  interpolation  will  lead  to  errors  if  there 
are  bands  that  cross  within  a  tetrahedron.  Also  neither  linear  nor  sampling  methods  can 
resolve  Van  Hove  singularities  properly.  These  are  kinks,  or  points  of  infinite  derivative 
in  the  density  of  states  which  are  caused  by  k-points  at  which  the  gradient  of  the  energy 
band  disappears.  Therefore,  since  the  linear  terms  vanishes  at  these  points,  one  m\ist 
interpolate  the  bands  at  least  to  quadratic  order  to  obtain  a  density  of  states  function 
with  analytically  correct  Van  Hove  singularities.  This  can  be  done  using  the  “analytic 
quadratic  method”  in  which  the  k-mesh  is  used  to  construct  a  network  of  tetrahedra 
which  have  a  k-point  at  the  mid  point  of  each  edge  as  well  as  at  each  apex.  This  allows 
the  eigenvalues  to  be  interpolated  to  quadratic  order  so  that  the  resulting  bandstructxire  is 
both  continuous  and  once  differentiable.  Unfortimately  this  approach  still  leads  to  errors 
from  band  crossings  inside  the  zone.  This  problem  is  largest  for  the  complex  of  the  fine 
d-bands,  which  in  our  case  are  completely  filled.  On  the  other  hand  it  is  well  known  that 
the  best  way  to  obtain  integrated  quantities  over  a  full  band  is  to  use  sampling  over  a 
regular  mesh.^®  Therefore  in  view  of  the  very  small  energy  differences  we  are  interested 
in,  we  have  adopted  the  following  strategy.  To  obtain  the  most  accurate  integrals  over 
the  d-bands,  we  have  used  a  sampling  method  over  a  mesh  constructed  by  marking  40 
divisions  along  the  three  reciprocal  lattice  basis  vectors.  For  displaying  densities  of  states, 
and  for  integrals  over  purely  s*  and  p-  bands,  we  have  used  the  analytic  quadratic  method 
using  eigenvalues  from  11700,  9139  and  19635  k-points  in  the  irreducible  wedges  of  the 
fee,  bcc  and  hep  Brillouin  zones  respectively.  For  easy  reference  in  what  follows,  we  show 
in  figure  2  the  fee,  bcc  and  hep  Brillouin  zones  and  the  standard  labelling  of  the  symmetry 
points  and  lines. 
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4.  RESULTS  AND  DISCUSSION 


In  this  section,  we  show  our  solution  to  the  Cu-Zn  problem.  Firstly,  we  calculate 
structural  total-energy  differences  using  the  structural  energy  difference  theorem  described 
in  the  previous  section.  Having  demonstrated  that  the  LDA-RB  approach  gives  the 
correct  answers,  we  then  attempt  by  successive  approximations  to  recover  the  Jones,  and 
Mott  and  Jones  theories  within  our  framework. 

4.1  The  local  density  approximation-rigid  band  results 

The  self-consistent  LMTO  bandstructures  of  fee  Cu  amd  Zn  are  shown  in  figure  3 
alongside  the  densities  of  states  calculated  using  the  analytic  quadratic  method.  These 
are  energy  bands  calculated  within  the  local  density  approximation  to  density  functional 
theory.  We  have  used  the  parameterisation  of  the  von-Barth-Hedin  exchange  and  correla¬ 
tion  energy  density  used  by  Moruzzi  ti  al.^^  The  self-consistent  potential  parameters  are 
given  in  table  1.  To  generate  the  bandstructures  of  bcc  and  hep  Cu  the  potential  param¬ 
eters  from  self-consistent  fee  Cu  are  used  and  only  the  structure  constants  and  combined 
correction  differ,  as  described  in  section  3.1.  The  resulting  bandstructures  are  shown  in 
figure  4,  where  we  concentrate  on  the  bands  around  the  Fermi  energy  lying  above  the 
d-bands.  Figure  5  shows  the  density  of  states  of  fee  and  bcc  Cu  superimposed.  In  this  fig¬ 
ure  the  Van  Hove  singularities  are  labelled  with  symbols  referring  to  the  Bragg-refiecting 
Brillouin  zone  boundary  planes  which  have  produced  the  singularities  and  the  symmetries 
of  the  corresponding  wavefunctions.  As  indicated  in  figure  2,  L  and  X  refer  to  the  fee 
{111}  and  {002}  planes,  while  N  refers  to  the  {110}  planes  in  the  bcc  zone.  They  are 
clearly  seen  in  figure  4  as  energies  at  which  the  slope  of  the  energy  bands  vanishes. 

Let  us  briefly  discuss  the  important  features  in  figures  3,  4  and  5.  We  first  note  that 
the  difference  between  Cu  and  Za  lies  primarily  in  the  position  of  the  d-bands  which 
in  Zn  are  well  below  the  Fermi  energy,  while  in  Cu  the  sp-bands  around  Cjr  are  clearly 
pertmbed  by  the  d-bands  which  are  rather  shallow.  We  note  that  the  Fermi  energy  in  Cu 
lies  between  the  bands  labelled  L21  and  L^.  L2/  is  a  pure  p  state  emd  the  gap  between  it 
and  the  pure  s  L^  state  above  it  is  caused  by  the  requirement  that  the  free-electron-like  s 
and  p  states  be  orthogonal  to  the  tightly  bound  d-states.^^  The  position  of  the  L^  state  is 
further  raised  due  to  “repulsion”  from  the  lowest  d  state  which  also  has  Li  symmetry.^S 
This  is  the  origin  of  the  measured  gap  of  4.1eV  used  in  Jones’  calculations.^  We  note  that 
the  effect  of  sd-hybridisation  is  to  produce  an  asymmetric  bemd  gap  at  the  Brillotiin  zone 
boundary.  Normally  in  perturbation  theory  the  effect  of  Bragg-refiection  is  to  split  the 
energy  evenly  about  the  &ee-electron  value.  (Asymmetric  splitting  is  a  common  feature  in 
the  bandstructures  of  noble  metals.)  As  expected  the  gap  becomes  smaller  in  the  absence 
of  strong  hybridisation  with  the  d-band  as  is  seen  to  be  the  case  in  Zn  (figure  3).  In 
the  Mott  and  Jones  picture,  there  are  no  d-bands,  the  gap  at  L  is  smaller  and  the  Fermi 
energy  lies  below  the  L2.  p-state.  We  see  that  the  real  life  case  is  rather  far  removed 
from  the  Mott  and  Jones  model.  The  origin  of  the  raised  Fermi  energy  is  the  shrinking  of 
the  constant  energy  surface  as  sd-hybridisation  is  turned  on,  the  sp-bands  bending  over 
to  join  into  the  highest  lying  d-bands.  This  can  be  seen  clearly  in  the  F  to  K  panel  in 
figure  3.  The  effect  of  sd-hybridisation  is  to  bend  the  uppermost  sp-band  in  towards  F 
so  that  at  a  given  energy,  the  surface  of  constant  energy  shrinks.  Therefore  in  order  to 
accommodate  a  given  number  of  electrons  the  Fermi  energy  must  be  raised. 

We  now  consider  the  structural  energy  differences  arising  from  application  of  the 
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structural  energy  difference  theorem  and  the  rigid  band  approximation.  In  all  that  follows 
we  will  use  results  based  on  the  self-consistent  fee  Cu  potential.  However  as  a  rather 
extreme  test  of  the  rigid  band  approximation,  we  show  in  figure  6  the  structural  energy 
differences  computed  using  both  the  self-consistent  Cu  and  self-consistent  Zn 

potentials  given  in  table  1.  Using  the  Cu  potential  certainly  bears  out  the  observed 
phase  transitions  in  the  Cu-Zn  phase  diagram.  In  order  to  assess  the  validity  of  the 
structural  energy  difference  theorem  in  the  atomic  spheres  approximation,  we  also  show  in 
figure  6  the  energy  differences  in  pure  Cu  and  Zn  calculated  using  a  full-potential  LMTO 
method.  We  believe  that  these  results  are  essentially  exact  within  the  local  density 
approximation.  It  is  notable  that  using  the  Cu  potential  one  can  reproduce  very  closely 
the  phase  ordering  in  Zn.  This  shows  that  in  this  case  the  rigid  band  approximation  is  not 
very  severe.  It  is  important  to  note  that  the  Zn  potential  leads  to  the  hep  structure  being 
slightly  more  stable  in  Cu.  This  is  because  the  Zn  potential  produces  relatively  low-lying 
d-bands,  so  that  the  sd-hybridisation  effects,  that  we  will  see  below  are  so  important  in 
stabilising  Cu  in  the  fee  structure,  are  not  reproduced. 

It  is  clear  that  the  shift  of  the  Fermi  energy  to  a  position  above  the  L2«  singularity, 
and  the  distortion  of  the  h:ee-electron-like  nature  of  the  bands  due  to  the  shallow  d-bands 
means  that  a  different  interpretation  of  the  phase  transitions  in  the  Cu-Zn  system  than 
the  traditional  one  is  called  for.  In  order  to  consider  the  LDA-RB  results  in  the  light 
of  the  general  considerations  of  section  2.2,  we  show  in  figure  8  bandstructure  energy 
differences  from  figure  6  and  the  density  of  states  and  difference  in  Fermi  energy  Aep 
plotted  as  functions  of  n.  (With  inclusion  of  the  d-b£aids,  note  that  the  total  number  of 
electrons  per  atom  is  n  -{- 10.) 

To  begin  with,  we  can  see  the  origin  of  the  stability  of  fee  Cu  in  the  local  density 
approximation.  Since  the  Fermi  energies  at  n  s  1  are  almost  equal  (-0.133,  -0.134  and 
-0.135  Ry  in  fee,  bee  and  hep  respectively),  we  can  conclude  that  the  stable  phase  of  Cu 
will  be  that  having  the  lowest  g{ef).  This  is  fee  as  seen  in  figures  5  and  8.  In  fact  we  can 
attribute  the  stability  of  fee  with  respect  to  bcc  to  the  shapes  of  the  respective  Brillouin 
zones.  We  see  from  figure  5  that  the  fee  density  of  states  is  low  between  the  L2.  and 
X4,  singularities,  while  the  bcc  density  of  states  is  reused  near  the  Fermi  energy  by  the 
singtilarity  at  Ni- 

At  higher  valence  electron  concentrations,  the  fee  density  of  states  is  raised  above  the 
bcc  by  the  X4«  singularity,  and  we  see  in  figure  8c  that  shortly  after  this  singularity  is 
reached  the  bcc  and  fee  Fermi  energies  become  equal  leading  to  a  minimum  in  the  bcc-fcc 
energy  difference  and  the  stabilisation  of  the  /?-phase. 

In  the  light  of  our  remarks  in  section  2.2,  we  cam  see  that  the  competition  between  fee 
and  bcc  structures  can  be  analysed  by  identifying  those  valence  electron  concentrations 
at  which  the  Fermi  energies  are  equal.  This  occurs  at  two  energies,  indicated  in  figure  8c 
and  by  vertical  lines  in  figme  5.  At  these  points,  fee  and  bcc  respectively  are  at  their 
most  stable,  the  stable  phase  being  that  with  the  lowest  density  of  states  at  the  Fermi 
energy. 

We  also  conclude  that  hep  is  stabilised  above  n  =  1.6  due  to  the  Lx  and  Nx>  singularites 
at  the  top  of  the  band  gap  in  the  fee  smd  bcc  structures.  This  effect  is  discussed  in  more 
detail  in  section  4.2.1.  The  ^-phases  are  stabilised  by  the  steady  linear  decrease  in  g{e)  in 
the  bcc  band  gap,  while  the  fee  density  of  states  encounters  a  further  singularity  at  the 
{200}  zone  faces.  Rather  than  the  L21  singularity  at  the  (111)  femes  of  the  Brillotiin  zone 
being  responsible  for  destabilisation  of  the  fee  pheise  as  in  the  Mott  and  Jones  theory,  in 
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the  LDA-RB  model  this  peak  is  washed  out  by  the  shoulder  from  the  d-bands,  and  the 
additional  faces  of  the  Brillouin  zone  at  {200}  axe  responsible  for  the  a  —*  trzuisition. 
Furthermore,  we  have  seen  that  features  at  the  various  Brillouin  zone  faces  have  conspired 
with  the  steeply  falling  shoulders  from  the  d- bands  to  favour  the  fee  phase  in  Cu.  We 
will  see  below  that  this  result  can  only  be  obtained  using  realistic  bandstructures  {i.e., 
explicitly  including  the  d-bands). 

From  the  data  of  figure  6a,  we  have  computed  firee-energy-composition  curves  in 
order  to  estimate  the  zero-temperature  phase  boundaries.  These  curves  are  shown  in 
figure  7b,  sheared  according  to  Zener’s  device^O  in  order  to  allow  the  common  tangents 
to  be  drawn.  Our  a|(a  -I-  /3)  and  (a  -I-  j3)\^  phase  boundaries  are  very  closely  spaced 
at  around  1.33  electrons  per  atom.  This  construction  is  only  useful  in  a  comparison 
with  Jones’  results^'^®  which  we  defer  until  section  4.2.1.  The  zero  temperature  phase 
boundaries  are  unknown.  Indeed,  it  is  reported^^  (see  figure  7a)  that  /3'-CuZn  decom¬ 
poses  eutectoididly  at  250* C,  so  that  the  true  limit  of  solubility  can  only  be  found  if  the 
equilibrium  with  the  7  phase  is  taken  into  account.t  This  has  also  been  neglected  by 
Turchi  et  al.^  who  however  find  a  zero-temperature  solubility  limit  at  n  =  1.25  which 
differs  from  our  n  =  1.32.  The  discrepancy  must  arise  from  our  use  of  the  rigid  band 
approximation,  which  is  also  inappropriate  for  the  determination  of  the  heat  of  formation 
of  CuZn.  In  our  LDA-RB  model  we  obtain  a  heat  of  formation  for  disordered  /3-CuZn 
of  -oOkJ/g-atom.  This  greatly  overestimates  the  experimental  heat  of  formation^  ^  of 
-9.4  ±  1.25kJ/g-atom;  and  our  calculated  value  of  -12.9kJ/g-atom  for  ordered  /3'-CuZn 
using  full-potential  LMTO.^^  We  have  independently  estimated^^  the  enthalpy  change 
upon  ordering  to  be  of  the  order  of  -4kJ/g-atom  (which  compares  well  with  a  value  of 
-3.okJ/g-atom  given  by  Turchi  et  a/.®),  so  that  the  heat  of  formation  is  properly  repro¬ 
duced  in  the  local  density  approximation,  identifying  the  rigid  band  approximation  as  the 
principal  source  of  error. 

4.2  Return  to  the  Simple  Theory 

We  are  now  at  the  point  where  we  have  applied  Jones’  methods  to  the  self-consistent 
bandstructure  in  the  local  density  approximation  in  the  Cu-Zn  system.  We  have  confirmed 
that  indeed  the  observed  phase  transitions  are  closely  linked  to  the  bandstructure  (as  is 
to  be  expected  in  the  light  of  the  structural  energy  difference  theorem)  and  we  have 
confirmed  that  the  the  LDA-RB  approach  predicts  the  correct  sequence  fee  — ►  bcc  — ► 
hep  as  the  valence  electron  concentration  increases  between  one  and  two.  But  we  have 
found  the  shallow  d-band  in  Cu  to  be  an  important  influence  in  the  problem.  Firstly  we 
find  that  the  features  of  the  bandstructure  thought  to  be  responsible  for  the  fee  — ►bcc 
transition  (namely  the  positions  of  the  L21  and  Van  Hove  singularities)  are  below  the 

t  In  which  case,  the  common  tangent  being  drawn  between  a  and  7  curves,  the  a\(a+$) 
phare  boundary  would  move  further  to  the  left  as  seen  in  the  experimentad  phase  diagram. 
The  phase  diagram  indicates  that  ^'-CuZn  (like  NiTi)  is  unstable  below  a  temperature  as 
high  as  600* C.  We  have  in  fact  fomd®^  that  in  the  local  density  approximation  /3'-CuZn 
has  a  zero  vadue  for  its  elastic  shear  constant  ^(c^  -  0x2)-  In  a  similar  calculation,  NiAl  is 
found  to  have  a  negative  vadue  for  this  elastic  constant.  This  is  to  be  expected  since  the 
bcc  structure  is  unstable  with  respect  to  this  shearing  in  the  absence  of  covalent  or  ionic 
bonding.®®  For  this  reason  /3'-CuZn  is  only  stabilised  at  elevated  temperature  by  a  large 
vibrational  entropy,  amd  is  not  stable  at  O’K. 
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Fermi  energy  and  largely  masked  in  the  upper  shoulder  of  the  d-electron  density  of  states. 
Secondly,  we  have  come  to  suspect  that  the  stability  of  fee  Cu  is  in  fact  due  to  the  effects 
of  the  d-band.  In  fact  Hafner^  has  pointed  out  that  a  general  feature  of  pseudopotential 
perturbation  theories  is  that  there  is  no  ground  state  fee  phase  at  a  valence  electron 
concentration  of  one.  This  he  attributes  to  the  neglect  of  sd  hybridisation  (see  also  Heine 
and  Weaire®). 

Our  next  step  is  to  m£dce  some  model  potentials  designed  to  reproduce  the  bandstruc- 
tures  of  simpler  theories.  We  consider  two  models  in  this  section.  In  the  first,  we  wish  to 
explicitly  separate  out  the  effect  of  the  d-bands  while  retaining  all  the  other  features  of 
the  bandstructure  in  the  local  density  approximation.  This  model  will  represent  rather 
closely  the  Jones  theory.  In  order  to  represent  the  Mott  and  Jones  theory  we  need  a 
nearly  free-electron  model  in  which  the  effect  of  the  d-bands  is  ignored  and  the  the  bsind 
gaps  are  smaller  than  those  due  to  sd  non-orthogonality  and  band  repulsion. 

4.2.1  The  Jones  Model 

We  seek  a  model  potential  with  which  to  apply  the  structural  energy  difference  theo¬ 
rem  and  which  as  closely  as  possible  mimics  the  Jones  model  described  in  section  2.  We 
achieve  this  using  a  procedure  in  which  we  re-calculate  the  potential  in  the  atomic  sphere 
in  the  configuration  4s4p4d  rather  than  4s4p3d,  i.e.,  constraining  the  d-wavefunction  to 
have  one  extra  node.  The  s  and  p  potential  parameters  are  therefore  unchanged  and  d 
parauneters  are  those  of  4d  rather  than  3d  electrons.  This  moves  the  d-electrons  out  of 
the  conduction  band.  Figure  9  shows  the  bandstructures  and  densities  of  states  of  fee, 
bcc  and  hep  Cu  in  the  model  potential.  This  closely  reproduces  the  sp  bands  of  figure  3 
while  leaving  out  the  d>electrons,  so  that  we  can  separate  out  the  d-band  effects.  The 
structural  energy  differences  are  shown  and  analysed  in  figtire  10.  The  principal  feature  of 
the  electron  compounds  appears,  namely  the  transitions  to  bcc  and  then  hep.  However, 
the  stability  of  Cu  at  n  s:  1  is  in  doubt;  in  fact  Cu  is  found  to  be  bcc,  with  only  a  small 
region  of  fee  stability  around  n  =  1.3. 

We  need  to  imderstand  why,  although  ours  and  Jones’  model  give  similar  looking 
densities  of  states,  the  stabilities  at  n  =  1  are  different.  The  reason  lies  in  the  shapes 
of  the  density  of  states  well  below  the  Fermi  energy  and  in  the  approximation  Jones’ 
used  in  his  e(k)  dispersion  and  in  calculating  integrals  under  the  Fermi  surface.  3  The 
conventional  widsom^*^^  has  it  that  given  two  densities  of  states  with  two  peaks  due  to 
Van.  Hove  singularities  as  in  figures  1  and  10,  at  valence  electron  concentrations  below  the 
peak  in  the  density  of  states  of  the  structure  whose  peak  comes  first,  that  structxire  will 
be  stable.  This  is  only  true  if  the  two  singularities  give  rise  to  peaks  of  similar  height. 
Both  ours  and  Jones’  calculations  show  that  the  bcc  Van  Hove  singularity  is  “stronger” 
than  the  fee  {i.e.,  it  makes  a  greater  distortion  from  a  free-electron  density  of  states). 
This  means  that  the  bcc  density  of  states  is  raised  above  the  free-electron  density  of 
states  by  a  greater  amount  than  the  fee  and  this  effect  persists  to  energies  well  below  the 
singularity.  The  consequence  is  that  bcc  is  the  stable  phase  over  a  wide  range  of  valence 
electron  concentration  between  0.2  and  1.1 .  The  reason  that  Jones  did  not  reveal  this  lies 
in  his  approximate  dispersion  which  only  deviated  from  free-electron  behaviour  close  to 
the  Brillouin  zone  botmdaries.  It  is  also  possible  as  suggested  by  Heine  and  Weaire^  that 
Jones’  approximate  method  of  Brillouin  zone  integration  (the  “cone  approximation”  )3»7led 
to  errors. 
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The  relative  “strengths”  of  the  first  Van  Hove  singularities  in  the  sequence  hep  : 
fee  :  bee  seen  in  figure  10  arise  simply  from  the  numbers  of  equivalent  Bragg  planes 
contributing;  these  are  respectively  the  {1120}  prism  planes  in  hep,  the  {111}  planes  in 
fee  and  the  {110}  planes  in  bee  of  which  the  numbers  of  variants  occur  in  the  ratio  6:8:12. 

As  in  the  LDA-RB  model,  we  can  apply  the  analysis  of  section  2.2  to  the  results 
shown  in  figtire  10.  We  recall  that  one  can  predict  the  structme  of  the  curves 

from  the  Fermi  energy  differences.  At  the  points  where  Atf  is  zero  one  can  predict 
either  a  maximum  or  a  minimum  in  AEi^^nd  depending  on  whether  Atp  is  decreasing  or 
increasing.  We  can  also  make  the  connection  between  Van  Hove  singularities  and  zeros  in 
the  Fermi  energy  difference  as  follows.  At  a  Van  Hove  singularity  the  Fermi  energy  does 
not  increase  with  small  increases  in  n.  Now,  we  are  treating  fee  as  the  reference  phase 
here,  so  if  the  Van  Hove  singularity  is  in  the  bcc  or  hep  density  of  states  then  Acj?  will 
decrease  rapidly  and  if  Atp  is  initially  positive  this  can  lead  to  a  maximum  in 
(i.e.,  fee  stability)  if  the  Fermi  energy  difference  falls  to  zero.  If  a  Van  Hove  singularity 
apears  in  the  fee  density  of  states  then  Aep  will  rise  rapidly  and  if  the  Fermi  energy  is 
higher  in  the  fee  phase  then  Aep  may  rise  through  zero  leading  to  a  minimum  in  AE^a„j, 
i.e.,  instability  of  the  fee  structure.  This  effect  can  be  clearly  seen  in  figure  10.  At  the 
bcc  and  hep  Van  Hove  singularities  Aep  falls  rapidly  through  zero  leading  in  turn  to 
maxima  in  A£(a„^.  The  fee  singularity  at  Lj  causes  sharp  increases  in  both  bcc  and  hep 
Acjr;  only  in  the  hep  case  does  this  cause  Aep  to  go  through  zero — ^this  is  the  origin  of 
the  minimum  in  the  hep-fee  AE|,f^J^J  curve  and  the  stability  of  hep  Zn.  Again,  this  is 
associated  with  a  low  hep  density  of  states  when  the  fee  and  bcc  density  of  states  rise 
rapidly  through  the  Lj  and  Np  Van  Hove  singularities. 

We  can  now  see  the  origin  of  the  stability  of  bcc  and  hep  in  our  reproduction  of  Jones’ 
model.  In  the  region  of  valence  electron  concentration  between  1.2  and  1.7  the  fee  density 
of  states  is  steadily  decreasing,  whereas  there  appear  large  peaks  in  the  bcc  and  then  hep 
density  of  states.  These  correspond  to  zeros  in  Aep  and  maxima  in  AE^g„j:  first  in  the 
bcc-fcc  curves  and  then  in  hcp-fcc.  This  accounts  for  the  successive  stability  of  fee  and 
bcc.  The  hep  phase  is  stabilised  when  the  bcc  emd  fee  densities  of  states  rise  sharply  when 
the  Fermi  surface  encounters  the  Brillouin  zone  boundaries  at  N  and  L  at  the  top  of  the 
large  band  gap. 

4.2.2  The  nearly-free-electron  (Mott  and  Jones)  model 

Finally,  we  would  like  to  re-create  the  very  simplest  model,  namely  the  nearly-free- 
electron  case,  first  proposed  by  Mott  and  Jones  and  usually  described  in  textbooks  as  the 
rationale  behind  the  theory  of  the  stability  of  ^-phue  alloys.  We  require  this  to  be  a  model 
without  any  effect  of  d-electrons  and  having  small  gaps  at  the  Brillouin  zone  boimdaries. 
We  can  generate  a  bandstructure  displaying  these  features  using  the  potential  parameters 
given  in  table  1.  The  resulting  bandstructure  and  densities  of  states  are  shown  in  figure  11. 
The  band  gaps  are  now  similar  to  those  in  a  nearly-free-electron  metal  such  sis  Al.  The 
structural  energy  differences  and  their  analysis  are  shown  in  figure  12.  We  conclude  that 
the  nearly-free-^ectron  picture  is  not  able  to  predict  a  transition  to  the  bcc  ^-phase.  In 
fact  Cu  is  predicted  to  be  hep  as  is  found  in  pseudopotential  perturbation  calculations, 
and  this  is  seen  to  be  due  to  the  free-electron  sphere  first  ma.king  contact  with  a  Brillouin 
zone  face  (at  M)  in  the  hep  structure.  Note  that  in  the  real  life  bands  of  figure  4,  the 
effect  of  the  singularity  and  small  band  gap  at  M  is  to  deplete  the  density  of  states  in 
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the  hep  phase  making  Lep  the  least  stable  phase  at  low  valence  electron  concentration. 
In  the  nearly-free-electron  case,  because  the  Fermi  surface  at  n  =  1  is  well  inside  the  first 
Brillouin  zone  in  fee  and  bcc  structures,  the  hep  phase  is  stable  in  Cu  although  the  band 
gap  at  M  is  small. 

5.  SUMMARY  AND  CONCLUSIONS 

We  have  attempted  to  clear  up  some  of  the  confusion  surrotmding  the  theory  of 
electron  compoimds.  This  has  involved  making  self-consistent  bandstructures  in  the  local 
density  approximation  and  using  these  in  conjunction  with  the  structural  energy  difference 
theorem  and  the  rigid  band  approximation  to  address  the  problem  from  a  rigorous  point  of 
view.  We  have  then  systematically  removed,  firstly  the  explicit  appearance  of  the  d-bands 
and  secondly  the  very  large  band  gaps  at  the  BriUouin  zone  faces  in  order  to  recover  two 
of  the  earlier  theories. 

Within  the  rigid-band  approximation  the  structural  energy  difference  theorem  tells 
us  that  the  bandstructure  is  responsible  for  energy  differences  if  the  bands  are  generated 
from  firozen  potentials.  Then  it  is  obvious  that  within  this  first-order  approach  the  shapes 
of  various  Brillouin  zones  must  eventually  be  determining  phase  stability.  This  fact,  cou¬ 
pled  with  our  plausibility  arguments  at  the  end  of  section  2.2  gives  in  our  view  a  general 
explanation  of  the  Hume-Rothery  electron  compounds  of  noble  metals  alloying  with  el¬ 
ements  from  groups  13,  14  and  15.  The  appearance  of  rather  constant  valence  electron 
concentrations  at  the  phase  boundaries  in  these  alloys  then  arises  as  a  consequence  of 
the  approximate  validity  of  the  rigid  band  model  and  the  similar  features  in  the  band- 
structures  of  the  noble  metals  (m  particular  they  all  have  a  Fermi  energy  above  the  L2> 
smgularity). 

Our  other  conclusions  are  as  follows. 

1.  In  the  specific  case  of  Cu  alloys,  the  sequence  of  phase  transitions  as  the  valence 
electron  concentration  varies  between  one  and  two  can  be  attributed  to  features  in  the 
electronic  bandstructure.  These  are  not  the  features  that  have  previously  been  thought 
responsible,  namely  the  singularities  arising  when  the  Fermi  surface  first  makes  contact 
with  the  Brillouin  zone  boimdaries.  Hybridisation  with  the  d-bands  moves  the  Fermi 
energy  well  above  these  singularities  even  in  p\ire  fee  and  bcc  noble  metals.  The  fee 
ground  state  of  Cu  is  a  direct  consequence  of  the  presence  of  the  shallow  d-band. 
As  shown  in  section  4.1,  both  the  stability  of  fee  Cu  and  the  transitions  to  bcc  and 
then  hep  can  be  attributed  to  specific  features  arising  from  the  shapes  of  the  different 
Brillouin  zones.  It  is  notable  that  the  )5-phase  is  stabilised  as  a  result  of  the  zulditional 
{200}  Brillouin  zone  planes  arising  in  fee  in  contrast  to  the  bcc  Brillouin  zone  which 
only  has  one  kind  of  face.  Stabilisation  of  hep  above  n  s  1.6  arises  as  the  Fermi  energy 
moves  above  the  bcc  and  fee  band  gaps. 

2.  The  predictions  of  the  Jones  model  are  qualitatively  correct,  although  the  positions  of 
the  phase  boundaries  are  too  sensitive  to  the  details  of  the  model  bandstructxire  for  this 
to  be  a  quantitative  theory.  Jones’  successful  predictions  of  both  the  experimental 
phase  boundaries  and  the  fee  ground  state  of  Cu  must  be  regarded  as  fortuitous 
accidents.  In  the  former  instance,  the  LDA-RB  approach  gives  much  worse  agreement 
with  experimental  phase  boundaries  than  the  much  more  approximate  Jones*  theory. 
The  fee  ground  state  of  Cu  cannot  be  obtained  in  an  sp-only  model  so  that  Jones’ 
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restilt  must  come  from  approximations  either  in  the  model  dispersion  or  in  the  method 
of  Brillouin  zone  integration. 

3.  The  nearly-free-electron  model  is  unable  to  predict  the  correct  sequence  of  phase  trans¬ 
formations.  Therefore  while  the  description  of  the  theory  of  electron  compounds  given 
in  elementary  texts  is  qualitatively  simple  and  pleasing,  it  cannot  be  demonstrated 
by  an  explicit  calculation. 

4.  The  popular  theory  attributing  stability  to  a  peak  in  the  density  of  states  is  incorrect. 

The  effect  of  a  peak  in  the  density  of  states  due  to  a  Van  Hove  singularity  is  to  produce 
a  rapid  change  in  the  difference  in  Fermi  energies  between  that  phase  and  a  competing 
one.  This  may  cause  Acy  to  go  through  zero  in  which  case  the  bandstructxire  energy 
difference  will  be  extremal:  whether  a  maximxim  or  minimum  depends  which  phase 
initially  has  the  higher  Fermi  energy  (in  other  words,  whether  Aep  is  falling  or  rising). 
In  the  cases  where  is  extremal  and  greater  than  zero  if  a  maximum  and  less 

than  zero  if  a  minimum,  the  stable  phase  will  be  the  one  with  the  smaller  density  of 
states  at  the  Fermi  energy;  furthermore,  the  two  phases  will  have  equal  Fermi  energies 
at  the  extremum  in  bandstructure  energy  difference.  Only  under  these  restrictive 
conditions  can  the  densities  of  states  at  the  Fermi  energy  be  used  as  a  measure  of 
structural  stability. 
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TABLE  1:  LMTO  potential  parauneters  for  self-consistent  Zn  and  Cu  and  for  Cu  in 
the  Jones,  and  Mott  and  Jones  models  (see  text).  C  and  A  are  the  band  centre  and 
width  parameters;  7  measures  the  distortion  &om  canonical  bands  and  1/^  is  the  third 
order  parameter  whose  magnitude  indicates  the  range  of  energies  about  the  centre  of  the 
occupied  band  in  which  the  linear  method  is  accurate.  is  the  muffin-tin  zero  and  is 
needed  to  construct  the  combined  correction  matrix.  The  Wigner-Seitz  radius  is  2.748 
bohr.  7  is  a  number;  all  other  quantities  are  in  Rydbergs. 
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C 

A 

7 

^mtz 

self-consistent 

Zn 

s 

-0.5347 

0.1442 

0.4129 

3.3266 

P 

0.3983 

0.1371 

0.1078 

4.9128 

d 

-0.5868 

0.0054 

-0.0005 

0.4899 

-0.7257 

self-consistent 

Cu 

s 

-0.4280 

0.1527 

0.4205 

3.7893 

p 

0.5143 

0.1465 

0.1113 

5.4501 

d 

-0.3132 

0.0078 

-0.0031 

0.5614 

-0.6937 

Jones  model 

Cu 

s 

-0.4280 

0.1527 

0.4205 

3.7893 

p 

0.5143 

0.1465 

0.1113 

5.4501 

d 

2.0776 

0.1247 

0.0437 

4.0860 

Mott  &  Jones  model  Cu  s  -0.4630  0.1324  0.4000  2.3829 

p  0.6251  0.1501  0.1125  5.6633 


-0.6937 


FIGURE  CAPTIONS 


FIGURE  1:  illustrates  Jones’  theory,  (a)  The  density  of  states  of  fee  and  bee  Cu.  The 
vertieal  line  shows  the  Fermi  energy  in  fee  Cu  in  Jones’  model,  (b)  The  fee-bee  energy 
diiferenee  as  a  fimetion  of  valenee  eleetron  coneentration  ealculated  by  Jones  Note  that 
the  peak  in  the  fee  density  of  states  oeeurs  at  n  2: 1,  and  in  the  bee  at  n  s  1.2.  (a)  and 
(e)  after  Jones^  (e)  Qualitative  sketeh  of  typieal  densities  of  states.  Energies  cj  and  C2 
mark  the  eommon  Fermi  energy  for  phases  F,  P  and  F,  B  respeetively,  for  whieh  the 
bandstrueture  energy  differenee  is  extremal. 

FIGURE  2:  (a)  fee,  (b)  bee  and  (e)  hep  Brillouin  zones.  (After  Jones^^) 

FIGURE  3:  Self-eonsistent  loeal  density  approximation  bandstruetures  in  fee  Cu  and 
Zn.  To  the  left  are  shown  the  eorresponding  densities  of  states.  Note  how  the  analytie 
quadratie  method  elearly  pieks  out  the  Van  Hove  singularities  in  the  bandstrueture.  Here 
and  in  the  next  two  figures  the  density  from  the  d-bands  is  not  shown  in  full  as  it  rises  to 
nearly  100  states/Ry/spin  (see  Moruzzi  et  dwarfing  the  strueture  relevant  to  the 
present  study.  In  this,  and  subsequent  bandstruetures,  the  position  of  the  Fermi  energy 
is  indieated  by  a  horizontal  line. 

FIGURE  4:  Bwdstruetures  and  densities  of  states  in  fee,  bee  and  hep  Cu  using  the 
frozen  potential  from  self-consistent  fee  Cu.  Only  the  energy  range  above  the  d-bands  in 
shown  here.  Symmetry  points  are  those  shown  in  figure  2. 

FIGURE  5:  Densities  of  states  of  fee  and  bee  Cu  superimposed  to  show  the  relation¬ 
ships  between  the  important  Van  Hove  singularities.  Searching  for  those  n  at  which  fee 
and  bee  have  the  same  Fermi  energy  leads  to  the  two  values  indicated  by  vertical  broken 
lines.  At  these  points  fee  and  bee  respectively  are  at  their  most  stable,  the  stable  phase 
being  that  having  the  lowest  density  of  states  at  the  Fermi  energy.  The  differences  in 
densities  of  states  can  be  attributed  to  interactions  between  the  Fermi  surface  and  the 
Brillouin  zone  planes  marked  in  the  figure. 

FIGURE  6:  bcc-fcc  and  hep-fee  bandstrueture  energy  differences  using  the  rigid  bzmd 
structural  energy  difference  theorem .  Two  frozen  potentials  have  been  used  (see  table  1): 
(a)  self-consistent  Cu  potential,  (b)  self-consistent  Zn  potential.  Crosses  are  the  end-point 
total  energy  differences  calculated  selfconsistently  within  the  local  density  approximation. 

FIGURE  7;  (a)  The  Cu-Zn  phase  diagram.^^  (b)  Free-energy-composition  curves 
based  on  the  data  in  figure  6(a).  The  ciirves  are  sheared  using  Zener’s  device:  is  the 

valence  electron  concentration  at  which  the  two  phases  joined  by  the  common  tangent 
have  the  same  energy,  denoted  UbeeJ  ^aid  is  the  slope  of  the  bee  curve  at  n  =  ni,gg.  A 
common  tangent  shows  the  positions  of  the  zero-temperature  phase  boundaries  deduced 
from  the  rigid  band  structural  energy  difference  theorem.  They  cannot  be  interpreted 
in  terms  of  the  actual  phase  diagram  since  the  a/7  eqmlibrium  is  not  considered  here; 
however,  we  show  them  to  compare  with  Jones  results.^'^® 

FIGURE  8:  Analysis  of  the  phase  transitions  in  the  rigid  band  structural  energy 
difference  theorem,  (a)  is  the  same  as  figure  7(b);  (b)  density  of  states  as  a  function 
of  valence  electron  concentration;  (c)  Fermi  energy  difference  as  a  function  of  valence 
electron  concentration.  The  units  of  energy  we  use  in  this  paper  are  Rydbergs  (IRy  = 
13.6eV,  and  ImRy/atom  s  1.23kJ/g-atom). 
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FIGURE  9:  6andstract\iie  and  density  of  states  using  our  parameters  for  Jones’ 
model  (table  1). 

FIGURE  10:  As  figure  8,  but  with  the  band  parameters  of  the  Jones  model 

FIGURE  11:  Bandstructure  and  density  of  states  using  our  parameters  for  the  Mott 
eind  Jones  model  (table  1). 

FIGURE  12:  As  figure  8,  but  with  the  band  parameters  of  the  Mott  and  Jones  model. 
Note  the  oscillations  in  the  hep  density  of  states  due  to  band  crossings. 
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Appendix  I 
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AlMtraet.  Phase  stability  in  the  Ni-Al  binary  system  is  investigated  using  linear  muffin-tin 
orbitals  total  energy  (lmto)  calculations.  They  provide  total  energies  for  the  different 
existing  compounds  and,  usii^  ConnoUy-Williams  inversion,  the  many-body  interactions 
occurring  in  the  KC  and  acc  lattkes.  These  interactions  are  used  in  conjunction  with  the 
Cluster  Variation  Method  (cvm)  to  calculate  the  phase  diagram.  The  computed  phase 
diagram  agrees  very  well  with  the  experimental  one. 


1.  Introdnctioii 

The  modern  theory  of  phase  diagram  calculations  has  been  made  possible  by  great 
advances  in  band-structure  calculations,  the  theories  of  the  configurational  thermo¬ 
dynamics  and  phase  transformations.  Total  energy  calculations  based  on  the  local 
density  approximhtion  (lda)  are  now  sufficiently  accurate  to  explain  many  properties 
of  materials  in  terms  of  the  underlying  electronic  structure  [1],  An  accurate  ^culation 
of  the  configurational  free  energy  of  the  alloy  is  possible  within  various  approximations 
such  as  mean  field  methods  (Ouster  Variation  Method,  cvm)  or  by  numerical  methods 
(Monte  Carlo  simulations)  [2, 3].  In  these  models,  it  is  assumed  that  the  internal  energy 
can  be  written  as  a  sum  of  multi-site  interactions  which  converge  rapidly.  The  fact  that 
these  interactions  can  be  derived  from  first-principle  calculations  establishes  the  basis 
of  a  comprehensive  first-principles  theory  of  cohesive,  structural  and  thermodynamical 
properties  of  metals  and  alloys.  Two  extreme  types  of  a|q[>roacb  to  the  calculation  of 
these  interactions  have  been  developed;  the  fitst  one  starts  from  the  energy  of  the 
completely  disordered  solid  solution  calculated  by  the  coherent  potential  approximation 
(cpa)  [4].  The  effective  cluster  interactions  are  calculated  by  the  Emb^ded  Ouster 
Methc^  [5]  or  by  the  Generalized  Perturbation  Method  (opm)  [6]  using  a  perturbative 
treatment  about  the  completely  disordered  state .  In  this  case ,  the  ordering  energies  can 
be  written  as  an  expression  in  terms  of  concentration-dependent  nth-order  effective 
cluster  interactions,  opm  can  be  developed  with  the  first-principles  multiple  scattering 
formalism  of  the  Korringa-Kohn-Rostoker  coherent  potential  approximation  [7];  how¬ 
ever,  to  our  knowledge,  no  phase  diagrams  have  yet  been  provide  by  this  approach 
although,  in  the  framework  (rf  the  tight-binding  approximation,  interesting  results  have 
been  obtained  [8-10]. 
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second  approach  is  the  so-calied  Connolly-Williams  inversion  [11]  and  the 
closely  related  e-G  approach  [12];  here,  the  total  energy  is  assumed  to  be  written  as  a 
sum  of  configuration  independent  many-body  interaction  potentials  multiplied  by  the 
multi-site  correlation  functions.  The  sum  runs  over  all  the  cluster  types  and,  in  practice, 
it  requires  the  existence  of  a  maximum  cluster  beyond  which  the  many  body  interactions 
are  supposed  to  be  negligible.  This  procedure  has  been  used  successfully  by  several 
authors  [13-lS]  and  we  will  adopt  it  here.  The  Ni-Al  system  presents  both  theoretical 
and  technological  interest,  the  aluminides  having  many  desirable  properties  such  as  low 
density,  high  melting  temperatures  and  high  yield  strength. 

From  a  theoretical  point  of  view,  Sigli  and  Sanchez  [16]  have  shown  that  a  cvm 
treatment  using  effective  pair  interactions  determined  from  available  thermochemical 
data  was  able  to  give  a  go^  representation  of  the  phase  diagram.  Several  quantitative 
total  energy  calculations  [17-20]  have  also  been  performed  for  ordered  AlNi  compounds 
using  linearized  methods  [2 1  ] .  All  these  calculations  conclude  that  a  strong  hybridization 
between  the  p  states  of  Al  and  the  d  states  of  Ni  leads  to  the  formation  of  bonding  and 
antibonding  states  well  separated  by  a  pseudogap  in  the  electronic  density  of  states.  The 
same  trend  has  also  been  found  using  Tight  Binding  arguments  [22].  A  few  calculations 
have  treated  more  subtle  effects  such  as  the  relative  stabilities  in  Ni3AI  compounds  [18]; 
van  Schilfgaarde  eta!  [23]  have  shown  that  non-self-consistent  total  energy  calculations 
can  be  applied  to  this  problem:  they  employ  the  linear  muffin-tin  method  [21]  using  two 
different  approaches.  In  the  first  approach,  within  the  atomic  sphere  approximation, 
they  employ  an  analogue  of  the  local  force  theorem  [24, 25],  in  which  the  total  energy 
difference  l^tween  two  structures  comprises  the  band-structure  energy  plus  small  Cou¬ 
lomb  corrections.  In  the  second  approach,  they  make  no  shape  approximations  to  the 
potential  and  employ  a  recasting  of  the  local-density  functional  proposed  by  Harris  and 
Foulkes  [26].  All  these  results  are  in  good  agreement  with  experimental  determinations 
of  the  energies  of  formation  of  Ni-Al  comounds  [27],  which  is  the  primary  condition  for 
the  phase  diagram  determination.  For  the  second  approach,  i.e.  calculations  of  the 
cluster  interactions,  a  series  of  results  has  been  proposed  by  Carlsson,  using  Connolly- 
Williams  inversion  [28,  29]  with  a  resummation  scheme  which  leads  to  concentration 
dependent  effective  pair  interaction  (epi).  These  so-obtained  epi  are  qualitatively  com¬ 
parable  to  the  ones  obtained  from  mean  field  methods  based  on  multiple-scattering 
theory  [30]  and  give  very  short-ranged  interactions  and  strong  ordering  tendencies.  A 
tight-binding  approach  based  on  the  Ouster  Bethe  Lattice  Method  (cblm)  has  also  been 
used  and  gives  epi  values  very  close  to  Carlsson’s  ones  [22]. 

The  purpose  of  this  work  is  to  present  a  complete  calculation  of  the  phase  diagram 
of  the  binary  NiAl  system  combining  the  tetrahedron  approximation  of  the  cvm  and  the 
linear  muffin-tin  orbital  method  (lmto).  Tliis  determination  requires  the  knowledge  of 
SRO  in  solid  solutions  based  on  see  and  FCC  underlying  lattices,  the  stability  of  the 
different  compounds  in  the  binary  system  and  the  liquid  phase.  The  paper  is  organized  as 
follows.  In  section  2,  we  present  a  brief  review  of  the  quantum  and  statistical  mechanical 
approaches  used  in  our  calculations.  In  section  3,  we  present  the  results  of  our  cal¬ 
culations  and  compare  them  with  the  experimental  ones. 


2.  Free  energy  model 

In  order  to  compute  a  phase  diagram,  we  need  to  know  the  total  free  energy  of  the  binary 
alloy  in  a  given  phase,  then  its  energy  and  also  its  entropy  of  formation. 
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For  the  alloy  in  phase  a,  the  total  free  energy  may  be  written  as 

=  (1  -  c)Fi,  +  cF|  +  £f  -  rSf  (1) 

where  c  is  the  concentration  of  the  B  element,  Ff,  the  free  energy  of  pure  element  I  in 
phase  a,  Ef  and  Sf  are  resp>ectively  the  enthalpy  and  entropy  of  alloy  formation. 

The  first  simplifying  assumption  regarding  the  evaluation  of  this  total  free  energy  is 
to  assume  that  the  free  energy  for  the  pure  elements  can  be  written  as 

Ff  =  £f  -  rSf  (2) 

in  which  the  cohesive  energy  £f  and  entropy  Sf  are  both  temperature  independent. 

The  second  assumption  assumes  that  Sf  is  a  purely  configurational  term,  which 
means  that  the  remaining  entropy  depends  only  on  the  concentration  via  the  first  two 
terms  on  the  right  hand  side  of  (1). 


2.1.  Energy  of  formation 

It  has  been  shown  in  the  introduction  that  to  perform  phase  diagram  calculations,  the 
internal  energy  of  an  A-B  alloy  is  assumed  to  be  described  in  terms  of  a  rapidly 
convergent  series  of  concentration-independent  multi-site  interactions.  More  precisely, 
we  assume  that  the  total  energy  of  a  particular  configuration  a  is  expressed  by  [31] 


£fo.(r)  =  2v,(r)$f  (3) 

y 

Vy(r)  is  the  concentration-independent  multi-site  interaction  associated  with  the  multi¬ 
site  correlation  defined  as  [32] 

ly  *  (l/Ny)  2 


where  a,  -  1  -  takes  the  values  -1-1  or  -1  depending  on  the  occupancy  of  site  n, 
Ny  is  the  total  number  of  y-type  clusters,  and  the  sum  runs  over  all  y-type  clusters  that 
can  be  formed  by  combining  sites  on  the  entire  crystal.  The  total  energies  and  the  multi¬ 
site  interactions  are  generally  lattice-parameter  r  (or  volume)  dependent. 

Given  (3),  the  interactions  are  determined  by  an  inversion  of  the  sum  for  a  finite 
number  of  configurations  defined  by  the  existence  of  a  maximum  cluster  Ymn  beyond 
which  the  multi-site  interactions  vanish. 

Hence ,  from  a  finite  number  of  total  energies  for  ordered  structures  and  by  truncating 
the  summation  in  (3),  a  set  of  multi-site  interactions  is  obtained  from 


Vy(r)-2(lf)-'£f(r)  <*»<y<y^ 

« 

Vy(r)  =  0  y-«<y<* 


(4) 


^  is  the  empty  cluster. 

Of  course,  one  performs  total  energy  calculations  of  as  many  ordered  structures  as 
there  are  unknown  multi-site  interactions  required  by  the  truncation  but  it  is  clear  that, 
in  this  approach,  this  truncation  leads  to  a  non-uniqueness  of  the  cluster  interactions 
[32].  However,  if  the  concentration-independent  multi-site  interactions  decay  rapidly, 
one  can  expect  that  this  difficulty  becomes  minor  and  then  in  practice,  the  interactions 
can  be  uniquely  computed. 
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In  a  series  of  papers,  [28, 29, 33],  Carisson  has  computed  cluster  interactions  for  FCC 
and  Bcx:  alloys  of  aluminium  with  transition  metals  using  Connolly-Williams  inversion. 
For  FCC  calculations,  he  used  A-FCC,  B-fcc,  A3B-LI2,  ABj-Ll;,  and  AB-LIq  with 
ideal  c/a  ratio.  Only  first-nearest-neighbour  pair  interactions,  triangle  and  tetrahedron 
cluster  interactions  are  included.  For  Bcc  calculations,  he  takes  A-bcc,  B-bcc,  A3B- 
DO3,  AB3-DO3,  AB-B2and  AB-B32.  In  this  case,  first-  and  second-nearest-neighbour 
pair  interactions,  triangle  and  tetrahedron  cluster  interactions  are  taken  into  account. 
He  finds  that  this  truncation  is  a  reasonable  approximation  for  these  systems;  for  a  Ni- 
A1  system,  the  convergence  of  the  cluster  expansion  appears  to  be  very  rapid,  three-  and 
four-atom  terms  having  15%  and  5%  of  the  magnitude  of  the  pair  terms  [28].  This  result 
is  confirmed  by  the  results  for  the  effective  pair  interaction  obtained  via  the  Generalized 
Perturbation  Method  applied  to  the  Korringa-Kohn-Rostoker  multiple  scattering  for¬ 
mulation  of  the  Coherent  Potential  Approximation  (kkr-cpa)  which  finds  roughly  a 
factor  of  ten  reduction  at  the  second-neighbour  distance  [30]. 

Carisson  has  also  shown  that  it  is  possible  to  convert  multi-site  interactions  into 
concentration  dependent  effective  pair  interactions  [28].  They  are  obtained  by  making 
a  truncated  approximation  in  the  higher-order  correlation  functions,  using  only  the  pair 
correlation  function.  Even  if  this  development  suffers  from  a  loss  of  accuracy,  the 
effective  pair  interactions  present  advantages  in  their  ease  of  interpretability  and  their 
practical  usefulness. 

Then,  using  first-principles  total  energy  calculations,  we  are  able  to  compute  the 
energies  of  formation  of  stoichiometric  compounds  occurring  in  the  studied  phase 
diagram  and  the  multisite  interactions  allowing  us  to  treat  short-range  order  in  the 
FCC-  or  BCC-based  solid  solutions  or  ordered  superstructures  presenting  an  extended 
concentration  range. 


2.2.  Configurational  entropy 

As  for  the  energies  of  formation,  different  configurational  entropy  approximations 
depending  on  the  nature  of  the  phase  being  considered  are  used.  For  the  strictly  stoi¬ 
chiometric  compounds,  the  configurational  entropy  is  taken  equal  to  zero.  For  the 
solid  solutions,  or  ordered  phases  presenting  an  extended  concentration  range,  the 
configurational  entropy  is  described  by  means  of  the  cvm.  The  cvm  entropy  is  found  to 
be  approximately  given  by  a  sqm  of  the  partial  cluster  entropies  [34].  llie  maximum 
cluster  used  in  our  study  is  the  tetrahedron  containing  first  and  second  neighbours  in  the 
BCC  lattice.  In  the  tetrahedron  approximation,  the  entropy  of  a  BCC  disordered  system 
is  given  by  [16] 

5BCC  ^-k^,UX  Wiiu  In  w^u  -  12 2  ti^k  In t^-irZ'Z yf  In yf  +  4 2 y JJ’  In 

^  ijU  (I*  ij  ik 

-F2x,lnx,)  (5) 

i  ' 

where  w^*,,  .  y  and  x,  denote ,  respectively ,  the  probability  of  finding  tetrahedra, 

triangles,  second-neighbour  pairs,  first-neighbour  pairs  and  points  in  the  configuration 
given  by  the  subscripts  (/  equals  A  or  B  in  a  binary  alloy). 
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For  the  disordered  FCC  structure  we  have 

SFCC  =  -kt  (2  2  In  -  6  2  In  +  5  2  Xi  In  ac,) .  (6) 

'  ijlk  ik  i  ' 

The  cluster  probabilities  are  related  by  the  following  consistency  relationships 


t^k  =  2 


yf  =  2  Wiju 

’  kl 


(7a) 

(76) 


3'"^  =  2h'^*/  (7c) 

ii 

Jtj  =  2  (7d) 

ilk 

For  the  case  of  an  ordered  phase  present  in  a  range  of  concentration,  long-range  order 
is  described  in  the  usual  manner  by  means  of  sublattices  reflecting  the  symmetry  of  the 
ordered  structure.  A  given  cluster  may  now  consist  of  points  in  the  crystal  belonging  to 
different  sublattices  and  their  probabilities  have  to  be  distinguished  accordingly  (see 
[35]  for  more  details). 


2.3.  Liquid  alloys 

Liquid  alloys  may  also  display  SRO  as  has  recently  been  shown  for  liquid  AlgoNi2o  alloy 
[36]  and  it  is  essential  to  consider  SRO  when  determining  the  thermodynamic  data.  The 
best  way  to  perform  such  calculations  is  to  use  a  variational  method  with,  as  a  reference 
system,  a  mixture  of  hard  spheres  which  ail  have  the  same  diameter  but  different  charges 
and  which  interact  throu^  a  screened  Coulomb  potential  [37].  This  reference  system 
has  been  found,  coupled  with  pseudo-potentials  or  tb-cblm,  to  describe  well  the  struc¬ 
tural  and  thermodynamics  manifestations  of  ordering  in  disordered  alloys  [38].  Very 
recently,  it  has  been  shown,  using  a  tight-binding  description  of  the  total  energy  of  the 
alloy  [39]  that,  for  the  thermodynamic  quantities,  similar  results  are  found  for  transition 
metal-based  alloys  if  the  liquid  configurational  free  energy  is  approximated  by  the 
tetrahedron  approximation  of  the  cvm  free  energy  in  the  disordered  FCC  structure.  We 
have  kept  this  approximation  to  describe  the  Ni-Al  liquid  phase  in  the  present  work. 


2.4.  Phase  equilibrium 

In  order  to  determine  the  equilibrium  phase  diagram,  it  is  more  convenient  to  minimize 
the  grand  potential  Q  given  by 

Q^F-ph  (8) 

where  p  is  the  effective  diemical  potential.  In  the  present  work,  the  minimization  of  the 
grand  potential  is  carried  out  with  respect  to  a  set  of  independent  configurational 
variables  and  the  variable  r,  since  effective  cluster  interactions  are  also  a  function  of  the 
lattice  parameter;  in  the  use  of  the  tetrahedron  approximation,  these  configurational 
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variables  are  chosen  to  be  the  tetrahedron  probabilities  at  constant  temperature  T 
and  effective  chemical  potential  n,  taking  into  account  the  normalization  constraint 

2  wm  =  1-  (^) 

ijik 

This  minimization  is  done  using  the  Natural  Iteration  (ni)  method  developed  by 
Kikuchi  [40].  The  Ni  equations  used  in  the  present  model  have  been  presented  elsewhere 
[16]  and  will  not  be  repeated  here. 

The  equilibrium  phase  diagram  between  the  two  phases  1  and  II  is  computed  using 
the  same  scheme  as  proposed  by  Kikuchi  and  Murray  [41].  For  the  same  initial  value  of 
the  effective  chemical  potential  ju,  the  grand  potentials  of  phases  I  (Qj)  and  II  (Qh) 
calculated  using  the  procedure  described  above.  If  Q|  =  Qh,  the  equilibrium  conditions 
are  realized,  but  if  not,  the  value  of  (i  is  modified  until  Q|  =  Qn. 


3.  Results  for  the  Ni-AI  system 

The  Ni-Al  phase  diagram  displayed  in  figure  1  is  characterized  by  a  liquid  phase,  a  Fcc 
(A  1 )  phase  at  both  the  Al  and  Ni  rich  ends,  a  non-congruently  melting  compound  AlsNi 
and  three  intermediate  phases  with  a  variable  range  of  solubilities  Al3Ni2,  AlNi  and 
AlNi3.  Several  attempts  have  been  made  to  describe  the  NiAl  phase  diagram .  As  already 
mentioned,  Sigli  and  Sanchez  [16]  have  shown  that  the  tetrahedron  approximation  of 
the  CVM  is  an  appropriate  system  to  describe  the  phase  diagram  of  this  binary  system. 

In  that  case,  epi  were  determined  from  experimental  thermodynamic  data  and 
available  phase  diagram  information;  these  EPt  are  in  good  agreement  with  the  ones 
determined  by  Carlsson  [28]  and  Colinet  etal  [22]  using  quantum  mechanics  arguments. 

3. 1 .  Ground  states  at  zero  temperature 

As  a  first  step,  we  have  calculated  the  energies  of  formation  of  the  four  intermediate 
phases  observed  in  the  equilibrium  phase  diagram.  In  the  struclurheric/K  notation,  these 
phases  arc  called  the  orthorhombic  DO20  (AlsNi)  phase,  the  hexagonal  DSjj  (Al3Ni2) 
phase,  the  cubic  B2(AINi)  phase  and  the  cubic  LI2  (AlNi3)  phase.  We  have  also  selected 
two  other  FCC  superstructures,  Ll2(Al3Ni)  phase  and  Llo(AlNi)  phases  and  three  other 
BCC  superstructures,  D03(Al3Ni),  D03(AlNi3)  and  B32(AlNi)  phases.  All  these  cal¬ 
culations  allow  us  to  check  our  prediction  of  the  correct  ground  states  and  also  to  extract 
multi-site  interactions  to  study  the  B2  (AlNi)  and  LI  2  (AlNi3)  phases,  stable  over  an 
extended  concentration  range  and  described  as  SRO-phases.  We  mention  that  although 
the  hexagonal  DS13  phase  is  experimentally  reported  to  be  stable  over  a  small  con¬ 
centration  range,  it  will  be  described  as  a  stoichiometric  compound. 

To  calculate  the  energies  of  the  different  phases,  we  have  employed  the  all-electron 
total  energy  local  density  formalism  as  carried  out  with  the  linear  muffin-tin  orbital 
(lktto)  method  [21].  The  lmto  calculations  have  been  done  in  the  atomic  sphere 
approximation,  including  combined  corrections,  as  developed  in  the  code  of  M  van 
Schilfgaarde,  A  T  Paxton  and  M  Methfessel  (unpublished  result).  We  have  used  the 
parametrization  of  the  von  Barth  Hedin  exchange  correlation  energy  density  given  by 
Moruzzi  et  al  [42].  In  our  Brillouin  zone  integrals,  we  use  a  uniform  mesh  of  sampling 
points  with  at  least  16  divisions  along  each  of  the  primitive  vectors.  With  such  a  mesh, 
we  obtain  a  convergence  of  the  absolute  value  of  the  total  energy  of  0. 1  mRy .  The  same 
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Flgve  1.  Experimental  phase  diagram  of  the  Ni-Al 
system. 


Flgiire  2.  Formation  energies  as  a  function  of  com¬ 
position  for  all  the  compounds  studied. 


Table  1.  lmto  results  for  cohesive  energies  and  equilibrium  molar  volumes  of  different 
structures  in  the  Ni-AI  system. 


Structure 

Composition 

Cohesive  energy 
(Umol-') 

Lattice  parameter 
(•u) 

rec  Ai 

Hi 

-584.231 

6.52 

LI, 

Ni,AI 

-586.732 

6.675 

DOs 

NiyM 

-584.074 

6.685 

LI, 

NiAl 

-548.698 

6.828 

LI, 

NiAl, 

-468.407 

7.185 

DOa 

NiAl, 

-469.580 

7.146 

AI 

-400.794 

7.522 

BCC  A2 

Ni 

-581.185 

5.188 

DO, 

Ni,AI 

-585.104 

5.270 

B2 

NiAl 

-568.112 

5.413 

B32 

NiAl 

-535.162 

5.426 

DO, 

NiAl, 

-463.064 

5.674 

Aj 

AI 

-395.569 

6.003 

D5„ 

Ni,Al, 

-536.019 

7.500 

DO« 

NiAl, 

-486.543 

12.350 

radius  value  was  taken  for  the  Wigner-Seitz  spheres  of  all  the  elements  and  we  include 
the  spherical  harmonic  up  to  /  »  2  (d  orbitals)  in  constructing  ths  basis  functions. 

For  each  structure,  the  total  energies  provided  by  the  lmto  method  are  obtained  for 
different  values  of  the  volume,  the  minimum  of  this  curve  determined  the  equilibrium 
total  energy  and  the  equilibrium  volume.  Moreover,  the  bulk  modulus,  which  is  related 
to  the  curvature  of  the  total  energy  with  volume  is  obtained  using  a  fit  based  on 
Mumaghan’s  equation  of  state  [43].  In  table  1,  are  calculated  equilibrium  cohesive 
energies  and  molar  volumes  of  idl  the  considered  phases.  We  recall  that  the  cohesive 
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energy  is  defined  as  the  difference  of  the  total  energy  of  that  phase  and  the  total  energy 
of  the  constituent  atoms  at  infinite  interatomic  distances.  The  ground  states  of  both  Ni 
and  Al  are  predicted  to  be  FCC.  The  energy  difference  between  the  fcc  and  Bcr  struc¬ 
tures,  defined  as  A£fcc-bcc  =  ~  is  3.046  kJ  mol"‘  and  5.224  kJ  mol”*  for 

Ni  and  Al  respectively.  We  mention  that  the  value  of  the  structural  energy  difference 
for  Al  is  very  close  to  the  one  obtained  by  flapw  calculations  [  15] .  Figure  2  shows  results 
for  the  formation  energies  as  a  function  of  composition  for  dl  the  studied  compounds. 
These  formation  energies  are  defined  as 

AE=E^-  Xu^E^H^i)  -  x^£SF(A1).  (10) 

At  the  equiatomic  composition,  the  calculated  formation  energies  of  the  competing 
phases,  i.e.  B2,  B32  and  LI  o  phases,  strongly  favour  the  B2  phase  in  complete  agreement 
with  experimental  data.  The  computed  value  for  A£ of  -75.6  kJ  mol’*  is  more  negative 
than  the  experimental  value,  -58.8  kJ  mol~*  (27).  However,  our  result  is  in  complete 
agreement  with  other  theoretical  determinations  [17]  but  this  phase  is  known  to  present 
anti-sites  and  vacancies  even  at  the  equiatomic  composition,  factors  which  are  not 
considered  in  these  calculations.  For  75%  of  Ni,  the  LI2  phase  is  predicted  to  be  more 
stable  than  the  DO3  phase,  both  being  more  stable  with  regard  to  a  mixture  of  the  Ni- 
Fccand  NiAl-B2  phase.  The  computed  value  for  A£  of  -48.36  kJ  moL*  compares  well 
with  the  experimental  value,  -41.0  kJ  mol~*.  For  25%  of  Ni,  the  LI2  phase  is  found  to 
be  yet  more  stable  than  the  DO3  phase,  but  its  value  is  located  well  above  the  line 
connecting  the  formation  energies  of  A1-fcc  and  NiAl-B2  phases.  In  fact  for  this 
composition,  the  most  stable  structure  is  the  D02n  orthorhombic  phase,  a  result  which 
is  also  obtained  in  our  calculations.  Its  calculated  formation  energy,  -39.89  kJ  mol~’  is 
in  complete  agreement  with  the  experimental  value,  -37.7  kJ  mol"'. 

At  40%  of  Ni  for  which  the  hexagonal  D5|3  phase  is  experimentally  reported  to  be 
stable  over  a  small  concentration  range,  there  is  no  superstructure  based  on  the  bcc  or 
FCC  lattice .  However,  it  is  predicted  to  be  more  stable  with  regard  to  a  mixture  of  NiAl3- 
DO20  and  NiAl-B2  phases  since  the  calculated  formation  energy,  -61.85  kJ  atom*', 
close  to  the  experimental  one,  -56.5  kJ  atom~',  is  just  above  the  line  connecting  the 
formation  energies  of  both  DO20  and  B2  phases. 


3.2.  Cluster  interactions  and  disordered  alloys 

As  already  mentioned,  the  concentration-independent  multi-site  interactions  can  be 
extracted  from  the  total  energy  calculations  using  the  Connolly-Williams  inversion 
method. 

In  figures  3(<i)  and  (6)  are  presented  cluster  interactions  obtained  for  the  fcc  and 
BCC  lattices  as  a  function  of  the  volume.  These  results  reveal  a  very  small  difference 
between  the  FCC  and  bcc  interaction  parameters.  Vq  and  K]  display  a  strong  volume 
dependence.  The  shape  of  Vq  is  similar  to  the  one  of  the  £(  V)  function  since  Vg  is  given 
by  a  sum  of  the  energies  of  the  different  superstructures  occurring  on  a  given  lattice, 
liie  shape  of  Vj  is  essentially  due  to  the  fact  that  the  two  alloying  elements  present  two 
different  volumes.  Let  us  mention  that  the  strengths  of  V3  and  are  much  smaller  than 
that  of  V2.  indicating  a  reasonable  degree  of  convergence  already  at  the  four  atom 
supercell  level.  At  this  level  of  discussion  we  recall  that  in  the  tetrahedron  approximation 
of  the  Connolly-Williams  method  for  FCC-based  structures,  the  LI  2  and  DO22  structures 
display  the  same  correlations  and  then  are  degenerate,  lmto  results  provide  that 
for  XNi  =  0.25,  £l,j  -  £0022  =  +  L17kJ  mol*’  while  for  Xn;  =  0.75,  £lij  -  1^0022  “ 
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FIgwc  3.  Ouster  interactions  as  a  function  of  the  volume  (a)  fcc  lattice;  (b)  bcc  lattice. 


-2.66  kJ  mor' .  These  two  differences  are  small  indicating  also  that  the  tetrahedron 
approximation  is  reasonable.  It  is  interesting  to  compare  the  results  obtained  by  lmto 
cidculations  to  the  ones  provided  by  gpm-kkr-cpa  [30]  or  tb-cblm  [22]  approaches.  This 
can  be  done  by  calculating  the  disordered  energy  and  the  effective  pair  interactions  as  a 
function  of  composition. 

The  disordered  energy  is  obtained  using  the  fact  that  the  pair  and  higher  order 
correlations  are  given  as  products  of  the  point  correlations  for  the  totally  disordered 
state.  Fo;  the  fcc  and  BCC-based  structures,  ail  atomic  positions  are  equivalent  and  we 
have: 

$f  =  (5i)"y  (11) 

where  n^  is  the  number  of  sites  contained  in  the  y  cluster.  The  energy  of  the  disordered 
configuration  is  then  given  by 

=  2  Vy($,)"y.  (12) 

r 

Comparing  cluster  interactions  firom  the  CVM  and  effective  pair  interactions  from 
GPM  or  CBLM  can  be  done  using  a  resummation  of  the  higher  order  cluster  interactions 
at  fixed  concentration,  corresponding  to  a  lowest  order  expansion  of  the  total  energy  in 
powers  of  the  short-range  parameters  [28]. 

With  the  set  of  clusters  used  in  our  calculations,  we  obtain  for  the  fcc  lattice 


Vf  =  4(Vj  +  3Vj|, -1-6^41?) 

(13) 

and  for  the  bcc  lattice: 

*  i(V<i  ’  +  2V3I,  -1-  4V4li) 

(14) 

where  the  effective  pair  interactions  Vf'  are  defined  as 

V^f  =1(Vaa  +  Vbb-2Vab) 

(15) 

Vaa,  Vbb-  Vab  ate  pair  potentials  evaluated  at  the  separation  between  first-nearest 
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neighbours  for  the  FCC  lattice  and  first-  and  second-nearest  neighbours  for  the  Bcc 
lattice. 

As  already  mentioned  by  Carlsson,  the  em  dependence  as  a  function  of  the  con¬ 
centration  is  given  by  a  higher  order  cluster  than  the  pair  here ,  triangle  and  tetrahedron . 
Two  distinct  approaches  are  used  to  obtain  the  concentration-dependent  EW:  (i)  the 
‘locally  relaxed’  treatment  for  which  each  cluster  in  the  alloy  is  able  to  relax  to  its 
preferred  lattice  constant;  (ii)  the  ‘frozen  lattice’  treatment  where  each  cluster  is  cal¬ 
culated  at  a  fixed  lattiw  constant.  We  use  the  ‘locally  relaxed’  treatment  of  the  lattice 
relaxation  effects,  which  is  considered  to  be  more  accurate  than  the  ‘frozen  lattice' 
treatment  [28].  In  figure  4  are  displayed  values  for  the  first-nearest-neighbour  inter¬ 
actions  on  the  FTC  lattice  as  a  function  of  (with  equal  toxurZAj).  Our  results  are  of 
course  very  similar  to  Carlsson’s  ones  since  the  augmented-^herical  wave  (asw)  and 
LMTO  methods  used  to  calculate  total  energies  are  essentially  the  same.  Tight-binding 
based  results  present  the  same  concentration  dependence  [22],  and  a  large  positive  value 
of  Vf®  at  the  Ni  rich  end,  consistent  with  the  very  strong  ordering  tendency  in  NijAl 
which  remains  ordered  up  to  its  melting  point;  at  the  Al-rich  end,  the  value  of  Vf  drops 
rapidly. 

Our  ^culated  epi  also  display  a  semiquandtative  agreement  with  tight-binding 
ones,  rrec  being  equal  to  2.84  kJ,  7.02  kJ  and  10.34  kJ  for  =  0.25, 0.5  and  0.75, 
respectively,  while  they  are  equal  to  2.65  kJ,  4.58  kJ  and  7.0  kJ  in  Colinet’s  approach.’ 
Another  instructive  comparison  can  be  done  with  epi  determined  by  Turchi  et  al  [30] 
in  Ni-Al  alloys  around  an  equiatomic  composition.  These  authors  found 
(bcc)  =  8.20  kJ  for  —  0.5  compared  to  9.97  kJ  extracted  from  lmto  calculations  or 
to  6.75  kJ  obtained  by  Colinet  et  al  [22]. 

In  fact ,  Carlsson  s  resummation  procedure  corresponds  to  a  high  temperature  expan¬ 
sion  of  the  correlation  functions  and  may  be  considered  to  be  a  good  approximation  to 
describe  effective  pair  interactions  in  the  liquid  phase.  For  this  kind  of  liquid  alloy, 
experimental  results  based  on  neutron  di^ction  experiments  [36, 44]  have  shovm  that 
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A-A  and  B-B  distances  are  quite  similar  in  the  alloys  but  different  from  the  nearest 
distances  in  pure  liquids.  Moreover  it  has  been  found  that  the  local  coordination  is 
roughly  equal  to  11,  very  close  to  the  Fcc-coordination.  Then,  for  each  concentration, 

and  used  to  describe  the  liquid  part  are  obtained  from  (12),  (13)  at  a  Axed  given 
volume.  However,  for  each  concentration  a  new  volume  is  used.  The  concentration- 
dependent  volume  is  taken  to  be  similar  to  the  one  obtained  for  the  rcc-based  solid 
solution  which  displays  a  similar  variation  to  the  experimental  one  seen  in  the  liquid 
phase  [45].  Another  argument  to  justify  this  approximation  is  that  if  the  pair  interactions 
of  the  FCC  and  BCC  lattices  come  out  similarly,  then  they  are  not  sensitive  to  the  underlying 
lattice  (at  least  for  the  Ni-Al  system). 

In  figure  5  are  plotted  these  values  of  the  as  a  function  of  the  concentration.  By 

comparison  with  the  EPl  obtained  by  the  ‘locally’  relaxed  treatment  on  the  fcc  and  BCC 
lattice ,  we  can  see  that  these  new  values  of  Vf'  display  a  smoother  variation  as  a  function 
of  the  composition. 

The  same  conclusion  is  reached  from  the  comparison  of  the  random  energy  obtained 
in  the  locally  relaxed  treatment  for  the  fcc  and  BCC  lattices  with  the  values  obtained  by 
this  new  treatment.  From  figures  4  and  5,  we  see  that  the  three  curves  display  negative 
values  at  all  concentrations;  the  minimum  is  shifted  towards  the  Ni  rich  part  for  the  two 
curves  obtained  by  the  ‘locally  relaxed’  treatment.  The  third  curve  displayed  in  figure  5, 
and  used  to  describe  the  liquid  phase,  presents  a  more  symmetrical  shape. 


3.3.  Phase  diagram  calculations 

Let  us  sum  up  the  strategy  of  our  calculations. 

(i)  The  energies  of  the  four  compounds  occurring  in  the  phase  diagram  are  obtained 
as  a  function  of  the  volume.  The  minima  of  these  curves  are  chosen  to  obtain  the 
formation  energies  of  these  compounds.  The  entropies  are  considered  to  be  equal  to 
zero  for  the  strictly  stoichiometric  compounds  such  as  AlaNi,  AlaNij  phases.  For  AlNi 
and  AINia  phases,  the  entropies  of  configuration  will  be  given  by  the  tetrahedron 
approximation  of  the  cvm  for  ordered  phases  based  on  the  fcc  or  BCC  lattices. 

(ii)  The  two  solid  solutions  based  on  the  fcc  and  bcc  lattices  are  described  as  short- 
range-order  solutions  using  the  cvm  treatment  in  its  tetrahedron  approximation.  The 
cluster  interactions  are  obtained  as  a  function  of  the  volume  using  the  Connolly-Williams 
approach. 

(iii)  The  difference  in  energy  between  fcc  and  bcc  structures  are  taken  from  lmto 
results. 

(iv)  The  liquid  phase  is  also  described  as  a  short-range-order  phase  using  the  tetra¬ 
hedron  approximation  of  the  cvm  free  energy  in  the  disordered  fcc  structure.  The 
effective  pair  interactions  are  obtained  from  Carlsson’s  resummation  procedure  which 
corresponds  to  a  high  temperature  expansion  of  the  correlation  functions.  Although  this 
model  is  developed  with  a  solid-state  approximation,  it  does  a  good  job  of  describing 
the  influence  of  chemical  short-range  order  on  the  thermodynamic  properties  of  the 
liquids  [39,  46].  Of  course,  the  continuous  phase  space  of  the  nucleii’s  motion  and 
structural  information,  such  as  pair  correlation  funaions,  are  lost.  However,  the  chemi¬ 
cal  ordering  contribution  to  the  entropy  and  the  energy,  which  are  quantities  of  central 
interest  in  our  calculations,  are  only  proportional  to  the  concentration  fluctuations. 
Using  a  statistical  approach  based  on  a  ‘discretized  mesh’  rather  than  the  continuous 
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Figure  t.  The  Ni-AI  phase  diagram  calculated  with 
only  Fcc>based  structures  taken  into  account. 


phase  space  of  the  nucleii's  motion  may  be  considered  as  correct  to  describe  the  con¬ 
centration  fluctuations. 

However,  there  is  still  a  ‘missing  link’  in  our  approach  which  is  the  thermodynamic 
properties  of  the  pure  metals  or,  in  other  terms,  the  difference  in  free  energy  between 
the  liquid  and  crystalline  phases.  Although  the  density  functional  theory  has  made 
significant  progress  in  the  modelling  of  liquids,  aj^lication  to  the  determination  of  the 
melting  temperatures  does  not  yet  appear  possible.  Therefore,  we  have  chosen  to  use 
thermodynamic  compilations  to  obtain  melting  temperatures  and  the  latent 

heat  of  melting  for  both  Ni  and  Al  elements  [47J.  In  this  case  the  free  energy 

of  these  elements  in  their  liquid  state  is  given  by 

Ff'(r)  =  -  r(A£«=^^“')/(rr^-'^).  (16) 

If  only  rec-based  equilibria  are  considered,  the  phase  diagram  of  figure  6  is  produced . 
Of  course,  only  the  Ni-rich  portion  can  be  directly  compared  with  experimental  results 
since  for  Ni  concentration  less  than  0.75  structures  other  than  FCC-bas^  superstructures 
appear.  The  main  features  of  the  diagram  are  a  miscibility  gap  (mg)  with  a  maximum 
temperature  of 2630  K  and  two  ordered  phases  LI  2  and  LIq  with  transition  temperatures 
of  2X20  and  3135  K  respectively.  As  ali^dy  mentioned  by  Carlsson  and  San^ez  [48], 
the  MG  is  caused  by  an  elastic  instability  and  can  be  understood  through  the  concentration 
dependence  of  tte  calculated  energy  of  mixing  for  the  completely  random  Fcc  solid 
solution.  A£md  negative  at  all  concentrations  like  the  curves  obtained  with  the  ‘locally 
relaxed'  treatment  (see  figure  4)  but  here  its  curvature  is  negative  for  Ni  concentrations 
less  than  30%:  this  curvature  is  due  to  the  peculiar  volume  dependence  of  V2,  V3  and 
V4,  and  another  way  to  come  to  the  same  conclusion  is  to  compute  the  phase  diagram 
ignoring  volume  effects  altogether.  In  that  case  the  diagram  displayed  in  figure  7  strongly 
resembles  the  previous  diagram  except  for  the  mg  on  the  Al-iich  side. 

When  the  families  of  both  fcc  and  Bcc-based  fiee  energy  curves  are  combined  with 
the  stoidtiometric  Al  jNi  and  Al  )Ni2  compounds  and  the  liquid  phase ,  the  phase  diagram 
of  figure  8  is  obtained.  All  the  main  features  of  the  experimental  phase  diagram  (see 
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figure  1)  have,  at  least  qualitatively,  been  reproduced.  The  quantitative  agreement  is 
not  as  good. 

(i)  The  B2  phase  is  found  to  melt  congruently  at  T  =  2600  K,  before  undergoing  a 
disordering  reaction.  Its  overestimate  of  the  melting  temperature  seems  to  be  attribu¬ 
table  to  the  overestimate  of  the  heat  of  formation  of  the  B2  compound.  However,  in  a 
more  recent  paper  [49]  Desai  gives  an  experimental  value  of  the  heat  of  formation  of 
the  NiAl  compound  equal  to  -71 .65  kJ  mol' '  which  is  very  close  to  our  calculated  value. 
He  mentions  the  difference  between  the  experimental  values  but  no  explanation  is 
provided.  Then  the  origin  of  the  discrepancy  of  the  calculated  phase  diagram  with  the 
experimental  phase  diagram  in  this  region  is  not  clear.  Let  us  r»»ll  that  this  compound 
is  known  to  present  antisites  and  vacancies  even  at  the  equiatomic  composition,  factors 
which  are  not  considered  in  these  calculations. 

(ii)  The  computed  phase  diagram  exhibits  two  eutectics,  one  at  the  Al-rich  side  and 
another  at  the  Ni-rich  side,  just  like  the  experimental  phase  diagram.  The  calculated 
eutectic  temperatures  are  in  relatively  good  agreement  with  the  experimental  ones  since 
they  have  been  found  equal  to  900  K  at  the  Al-rich  side  and  1680  K  at  the  Ni-rich  side 
(compared  with  913  K  and  1658  K  respectively). 

(iii)  The  two  complex  DO^  and  D5|}  structures  are  found  to  melt  peritectically.  For 
the  Al3Ni  compound,  the  peritectic  temperature  is  some  27  K  lower  than  experiment 
indicates  but  for  the  Al3Ni2  compound  it  is  56  K  higher  than  the  exfierimental  one. 

Our  calculations  lead  also  to  a  peritectic  decomposition  of  the  NijAl  compound. 
More  particularly,  the  eutectic  between  the  solid  phase  A],  Ni3Al(Ll2)  and  the  liquid 
one  takes  place  on  the  right  of  the  peritectic,  which  is  in  agreement  with  the  usually 
adopted  phases  diagrams.  However,  let  us  mention  that  this  part  of  the  phase  diagram 
is  very  sensitive  to  the  values  of  the  energies  of  the  three  phases  which  determine  the 
equilibrium  properties.  From  an  experimental  point  of  view,  we  point  out  that  the  phases 
boundaries  of  tlw  peritectic  decomposition  of  the  Ni  3A1  compound  are  still  controversial . 
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4.  Conclusion 

It  was  shown  that  ab  initio  calculations  of  the  Ni-AI  phase  diagram  are,  in  many  points, 
in  agreement  with  experimental  information.  They  have  been  obtained  by  combining 
total  energy  lmto  calculations  with  the  tetrahedron  cvm  approximation.  No  relaxation 
and  no  vibrational  effects  have  been  taken  into  account.  Only  short-range  order  on  FCC 
and  Bcr  lattices  have  been  introduced  using  cvm  treatment  in  competition  with  the 
occurrence  of  complex  phases  like  the  DO20  and  DSia  phases.  The  thermodynamic 
description  of  the  liquid  phase  has  been  achieved  by  approximating  this  phase  to  a  FCC- 
based  disordered  phase  to  minimize  the  number  of  parameters.  The  melting  temperature 
and  latent  heat  of  the  pure  components  are  the  only  two  parameters  which  have  been 
introduced  from  outside  the  first-principles  matrix.  We  Ixlieve  that  our  results  show 
that  first-principles  studies  of  phase  equilibria  may  now  be  considered  as  feasible  with  a 
good  degree  of  confidence. 
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Introduction 

Recent  interest  has  focused  on  the  transition  metal  trirduminides  as  candidate  phases  in  the  microstructure  of 
high-temperature  superalloys.  They  have  excellent  oxidation  resistance,  and  the  ordered,  intermetallic  compounds 
should  have  good  high-temperature  strength  as  do  conventional  cobalt  and  nickel  based  alloys.  However,  the 
compounds  so  far  studied  all  suffer  transgranular  failure  at  room  temperature,  both  those  with  cubic  (LI2)  and 
non-cubic  crystal  structures  (DOj^,  DO23  ^**19)-  understand  and  predict  firstly  the  crystal  structure 

adopted  by  transition  metal  trialuminides,  and  secondly  their  susceptibility  to  cleavage  fracture.  Cottrell  (1)  has 
shown  that  the  latter  can  be  rationalised  within  Pugh’s  analysis  (2)  and  finds  that  a  ductile  crystal  will  have  a 
ratio  of  shear  to  bulk  moduli,  n/K  <  0.4;  while  a  brittle  crystal  has  /i/A'  >  0.5.  This  has  been  confirmed  for  the 
LI}  phases  of  Al3Ti  and  AI3SC  (3).  The  stable  cubic  transition  metal  trialuminides  also  tend  to  have  a  negative 
Cau^y  pressure  (c]}  -  C44)  which  indicates  the  effect  of  angular  bonding  forces  over  and  above  the  usual  ndiil 
forces  associated  with  the  metallic  bond  (4).  In  this  paper,  we  shall  use  the  results  of  first-principles  calculations 
to  show  that,  on  the  basis  of  these  criteria,  AI3RU  has  a  good  likelihood  of  being  the  only  ductile  transition  metal 
trialuminide. 

The  low  temperature  crystal  structures  of  the  transition  metal  trialuminides  are  shown  in  the  lower  part  of 
fig.  I .  The  entry  for  AI3RU  is  taken  from  present  predictions,  otherwise  the  data  come  from  refs.  5  and  6.  We 
see  that  whereas  group  3,  4  and  5  metals  form  stable  close-packed  trialuminides;  of  the  others  only  AI3RU  forms 
a  close-packed  phase.  This  is  consistent  with  Carlsson’s  calculations  of  the  three-body  term,  #3,  in  the  Connolly- 
Williams  expansion  of  the  total  energy  (7).  Carlsson  found  that  ^3  oscillates  across  the  3d  and  4d  transition  series, 
being  negative  in  the  neighbourhood  of  groups  3,  4,  7  and  8;  and  otherwise  zero  or  positive.  This  corresponds 
with  an  observed  skewing  of  the  heats  of  formation  away  from  regular  solution  behaviour  (4,  8),  stabilising  the 
trialuminides  of  Sc,  Ti,  V  and  Ru;  while  as  seen  in  fig.  1,  elsewhere  in  the  transition  series  close  packed  trialuminides 
are  not  observed. 

We  devote  this  paper  to  theoretical  a  study  of  AI3RU,  in  the  light  of  the  above  remarks.  Very  little  is 
known  about  the  Al-Ru  system.  The  phase  diagram  is  only  tentatively  known  (6).  The  structure  of  AI3RU 
is  reported  as  DO24  in  Pearson’s  Handbook  (5),  with  lattice  parameters  leading  to  an  unphysically  low  atomic 
volume.  Fortunately,  the  local  density  approximation  (LDA)  to  density-functional  theory  is  known  to  make  accurate 
predictions  of  the  crystal  structures  and  elastic  constants  of  crystab  from  first  principles,  and  we  exploit  this  using 
a  basis  of  linear  muffin-tin  orbitals  (LMTOs),  in  three  distinct  approaches  (9,  10).  (i)  Using  a  full-potential, 
self-consistent  method  (11),  we  can  obtain  results  corresponding  to  the  LDA  solution  to  Schrodinger’s  equation, 
(ii)  In  order  to  save  computing  time,  we  make  the  non  self-oonsistent,  Hairis-Foulkes  ^proximation  when  we  are 
confident  that  the  associated  errors  are  acceptable,  (in)  For  predicting  structural  stability  of  close-packed  phases, 
we  use  the  atomic  spheres  approximation  (ASA)  to  LMTO  theory,  using  the  structural  energy  difference  theorem 
and  rigid  band  approximation.  Thu  also  allows  us  to  construct  a  model  to  extract  general  principles  arising  from 
the  electronic  structure  of  transition  metal  trialuminides. 

We  divide  the  paper  into  two  sections:  crystal  structure  and  elastic  constant  predictions.  A  final  two  sections 
contain  our  discussion  and  conclusions. 
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In  order  to  calculate  energy  differences  between  com¬ 
peting  close-paclced  phases,  we  use  the  structural  energy 
difference  theorem  associated  with  the  atomic  spheres  ap¬ 
proximation  (9).  Spinning  with  a  self-consistent  calcu¬ 
lation  of  the  density  of  states  (DOS)  and  atomic  sphere 
potentials  in  the  LI2  structure,  we  transfer  this  trial  poten¬ 
tial  into  another  structure  and  calculate  its  DOS.  From  the 
two  densities  of  states,  we  calculate  the  sum  of  occupied  en¬ 
ergy  levels  and  subtracting  these,  we  obtiun  the  structural 
energy  difference  correct  to  first  order.  It  is  instructive, 
first  of  all,  to  look  at  the  DOS  in  AI3RU  (fig.  2),  since  this 
demonstrates  all  the  principal  features  of  chemical  bonding 
in  the  transition  metal  trialuminides;  these  are  sketched 
in  fig.  3.  At  the  bottom  of  the  band  the  electrons  have 
predominantly  free-electron-like  Al-s  character.  The  three 
peaks  are  typical  of  the  transition  metal  trialuminides.  The 
lower  and  upper  peaks  are  bonding  and  antibonding  hybrid 
Al-p-Ru-d(f^)  states.t  The  central  peak  is  predominantly 
nonbonding  Ru-d(eg)  (the  peak  is  positioned  just  0.05  Ry 
above  the  centre  of  the  <f-band  indicating  very  weak  an¬ 
tibonding  character).  The  lower  peak  also  has  a  bonding 
Al-p-Ru-d(ef)  contribution.  In  the  transition  metal  tria¬ 
luminides,  the  bonding  levels  are  full  and  the  antibonding 
levels  empty  leading  to  strong,  directional  bonding  between 
the  transition  metal  and  A1  atoms.  There  is  also  metallic 
bonding  mitigated  by  the  iree-electron-Uke  A1  and  transi¬ 
tion  metal  s-electrons.  In  fig.  4,  we  show  the  DOS  for  LI2 
Al3Ti.  Elssentially  the  same  features  are  present:  the  d- 
band  is  shifted  to  higher  energy  (the  (/-electrons  become 
more  tightly  bound  to  the  nucleus  across  the  transition  se¬ 
ries)  and  the  nonbonding  states  are  empty  since  Ti  has  4 
fewer  (/-electrons  than  Ru.  This  leads  one  to  the  develop¬ 
ment  of  the  rJgid  band  model  (12).  Here  we  suppose  that 
we  may  describe  the  electronic  structure  of  the  transition 
metal  trialuminides  using  a  rigid  DOS,  say,  that  of  AI3RU; 
and  model  all  the  transition  metal  trialuminides  using  this 
DOS,  varying  the  Fermi  energy  to  account  for  the  total 
number  of  electrons. 

Now  consider  the  upper  part  of  fig.  1.  Using  the  DOS 


Q  LI2  O  OO^g  O  DO23  X  DOn  luminides,  the  bonding  levels  are  full  and  the  antibonding 
(DO22  axial  ratio  shown)  levels  empty  leading  to  strong,  directional  bonding  between 

the  transition  metal  and  A1  atoms.  There  is  also  metallic 
Fig.l  t  The  laxoeT  Hurt  shows  (host  transition  met-  bonding  mitigated  by  the  iree-election-Uke  A1  and  transi- 
als  which  form  trialuminides  with  a  symbol  accord-  tion  metal  s-electrons.  In  fig.  4,  we  show  the  DOS  for  LI2 
ing  to  the  legend  given.  The  upper  part  shows  co/-  Al3Ti.  Elssentially  the  same  features  are  present:  the  d- 
culated  structural  energy  differences  (ImRy/atom  =  band  is  shifted  to  higher  energy  (the  d-electrons  become 
1.2s  kJ/g-atom).  The  solid  curves  show  the  energies  more  tightly  bound  to  the  nucleus  across  the  transition  se- 
0/  non-cubic  trialuminides  relative  to  LIg  as  a  func-  lies)  and  the  nonbonding  states  are  empty  since  Ti  has  4 
tion  of  the  electron  concentration  corresponding  to  the  fewer  d-electrons  than  Ru.  This  leads  one  to  the  develop- 
(ransifion  metals  in  the  lower  part.  The  rigid  band  ment  of  the  rigid  band  model  (12).  Here  we  suppose  that 
approximation  uses  the  DOS  of  Al^Ru  in  the  differ-  we  may  describe  the  electronic  structure  of  the  transition 
ent  structures,  varying  eja  while  keeping  the  DOS  un-  metal  trialuminides  using  a  rigid  DOS,  say,  that  of  AI3RU; 
changed,  so  that  at  the  vertical  broken  line  the  calcula-  and  model  all  the  transition  metal  trialuminides  using  this 
tions  are  not  approximated.  The  dotted  line  ^ows  the  DOS,  varying  the  Fermi  energy  to  account  for  the  total 
DO22-LI2  energy  difference  using  the  Ti  DOS.  Us-  number  of  electrons. 

ing  this  DOS  we  can  predict  the  structure  of  Al^Ru.  Now  consider  the  upper  part  of  fig.  1.  Using  the  DOS 

of  AI3RU  in  four  close-packed  structures  obtained  from  the 
trial  potential  of  the  LI2  phase,  we  have  calculated  the  ener^es  of  DO22,  ^^4  ^10  (*U  ratios) 

relative  to  LI  3.  These  are  shown  at  the  intersection  of  the  solid  curves  with  the  vertical  broken  line.  In  the  spirit 
of  the  rigid  bud  approximation,  we  use  the  AI3RU  DOS  to  infer  the  energy  ordering  of  all  other  transition  metal 
trialuminides  which  results  in  the  solid  curves  of  energy  difference  against  dectron  concentration  e/a.  Very 
much  the  same  curves  are  obtained  if  we  use  the  Al3Ti  DOS  from  fig.  4.  For  example,  in  fig.  1  the  broken  curve 
shows  the  DO32-LI2  energy  difference  obtained  in  this  way.  It  is  remarkable  that  using  the  electronic  structure 
of  either  Al3Ti  or  AI3RU,  one  can  determine  trends  in  the  structural  stability  of  all  the  other  transition  metal 
trialuminides.  Our  calculations  shown  in  fig.  1  are  for  structures  with  ideal  axial  ratio.  Therefore  our  nradel  is 
not  able  to  predict  the  structures  of  the  group  3  and  4  trialuminides  (apart  from  AI3SC).  These  have  greater  than 
10%  distortions  from  ideal  axial  ratio.  As  we  see  from  fig  1,  it  is  this  dist(»tion  that  stabilises  the  observed  phase; 

tThe  cubic  symmetry  has  split  the  five  degenerate  atomic  (/-states  into  sub-bands  with  eg  (x^  -  p*,  3z^  -  1) 
and  ijg  (xy,  yr,  *x)  symmetry. 
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f  ig.2  ;  LMTO-ASA  density  of  states  of  Ll^  Al^Ru.  A 
vertical  line  shows  the  position  of  the  Fermi  energy  vdiich 
separates  occupied  from  unoccupied  energy  levels.  The 
lower  panels  sAow  the  DOS  decomposed  into  orhiials  to 
demonstrate  the  principal  features  of  the  chemical  bond¬ 
ing  (see  the  text,  and  fig.  S). 


Fig.3  :  4  schematic  drawing  of  the  important  features 
of  the  DOSs  shown  in  fig.  2.  See  the  text  for  details. 


Fig.4  :  Density  of  states  in  Llj  AljTi.  The  Fermi 
energy  is  indicated  as  in  fig.  2. 


this  is  already  known  in  the  case  of  Al3Ti  from  total  energy  calculations  (4,  9).  Very  importantly,  we  see  that 
while  the  group  4  trialuminides  are  on  the  borderline  between  LI2  nnd  DO22  and  can  be  made  cubic  by  alloying 
(13),  the  group  5  compounds  cannot  (14). 

Our  calculations  show  that  AI3RU  has  the  DO32  structure,  not  DO24  as  reported  (5).^  Furthermore,  we  see  a 
very  wide  region  of  DO22  stability  around  this  electron  concentration,  so  we  regard  it  as  highly  improbable  that 
AI3RU  can  be  made  cubic  by  alloying. 

FTaatic  constant  calculations 

We  need  to  make  the  most  accurate  totsJ  energy  calculations  to  calculate  elastic  constants  (11).  Not  only  the 
band  terms,  as  in  the  structural  energy  difference  theorem,  but  also  electrostatic  and  exchange-correlation  terms 
must  be  included  since  these  do  not  cancel  in  energy  differences  to  second  order.  For  shear  modulus  calculations 
we  make  a  series  of  calculations  of  the  total  energy  at  a  fixed  amount  of  a  pure  strain.  We  vary  the  strain  in 
eleven  evenly-spaced  increments  in  the  range  dbO.OS  to  obtain  the  total  energy  as  a  function  of  strain.  The  second 
order  terms  are  extracted  by  least-squares  fitting  to  polynomials.  Very  stringent  convergence  is  required  since  the 
energy  varies  by  typically  only  a  few  mRy  over  the  whole  range  of  distortions,  and  great  care  is  needed  if  one  is 
to  obtain  a  smooth  fit  to  the  data.  Typically,  our  self-consistent  energies  are  converged  to  better  than  O.OlmRy, 


IWe  should  emphasise  that  our  model  is  strictly  concerned  with  trialuminides.  We  cannot  make  any  statements 
«  'ceming  phases  such  as  AIi3Fe4  which  has  nearly  the  same  stoichiometry  and  is  sometimes  referred  to  as  AlsFe. 
A  milar  phase  in  the  Al-Ru  system  may  also  exist. 
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and  we  employ  a  modified  tetrahedron  method  of  Brillouin  zone  integration  (15)  using  a  uniform  mesh  of  at  least 
18^  k-points  in  the  whole  zone.  The  bulk  modulus  is  calculated  from  nine  separate  dilatations  in  the  range  iO.2; 
since  the  energy  changes  by  much  larger  amounts  convergence  is  less  critical  here.  We  divide  the  presentation  of 
our  results  into  those  for  Llj  and  DOjj  structures. 

Li2  structure 


Although,  as  we  have  shown,  AIjRu  is  never  likely  to  exist  in 
the  Llj  structure,  it  is  instructive  at  first  to  examine  its  three  elas¬ 
tic  constants  in  that  phase.  Conveniently,  these  are  the  bulk  mod¬ 
ulus  K  =  |(c|i  -f  2cj2),  and  the  two  shear  constants  C  =  and 
^  table  1,  we  show  these,  the  Cauchy  pressure  and 
the  ratio  P2/^  where  P2  =  (3^44(011  -Ci2))/(4c44-f  cji  -C12)  is  the 
reciprocal  of  the  elastic  compliance  of  a  cubic  crystal  orientated  for 
[ll2]  or  [IlO]  slip  on  (111)  planes  (Id).  We  also  show  the  same  quan¬ 
tities  we  have  calculated  for  A!3Ti.  We  see  that  AI3RU  differs  signifi¬ 
cantly  from  Al3Ti,  in  that  it  satisfies  the  criteria  for  ductility  discussed 
in  the  introduction,  namely  a  positive  Cauchy  pressure  and  filK  <  0.4. 


TABLE  1 

Calculated  Elastic  Constants  of 


LI2  AI3RU  and  Al3Ti 


c 

c 

1 

K 

GPa 

C12-C44 

/tjIK 

AURu  103 

38 

160 

31 

0.30 

(101)  (35) 

(160) 

AlsTi  87 

68 

117 

-15 

0.63 

17022  structure 


In  table  1,  we  have  also  shown  in  parentheses  the  elastic  constimts  TABLE  2 


for  AI3RU  calculated  non  self-consistently  using  the  Harris-Foulkes  ap¬ 
proximation  (0, 10).  There  is  clearly  negligible  error  involved,  while  the 
saving  in  computer  time  is  considerable.  For  calculations  in  the  DO22 
phase  of  AI3RU,  we  continue  to  use  the  non  self-consistent  approach.  We 
show  our  calculated  lattice  constants  in  table  2;  and  in  table  3,  calculated 
elastic  constants  of  AI3RU,  which  we  compare  with  the  measured  elastic 
constants  of  Al3Ti  (17). 

In  table  3,  the  last  four  columns  show  firstly  the  two  Cauchy  dis¬ 
crepancies,  and  secondly  the  ft/K  ratios  for  the  known  slip  systems  in 
DO22  slloys.  Pi  =  C4.,  is  the  reciprocal  compliance  in  |a(110](001)  slip; 
and  P2<  same.expression  as  given  above  for  cubic  crystals,  is  for 

a[110]  slip  on  the  close-packed  (ll2)  planes.  The  former  slip  system  is 
operative  in  DO22  Al3Ti,  while  the  latter  is  observed  in  AI3V  (18). 

TABLE  3 


Lattice  Parameters  of  DO22 
A]3Ru  and  Al3Ti 


0 

(nm) 

c/a 

Al3Ru(«)  0.381 

2.01 

> 

*5 

0 

00 

2.23 

Calculated  in  the  present  work 

Calculated  in  ref.  (9) 

Elastic  Constants  of  DO22  AI3RU  and  AlsTi 


K  cii  C33  C12  ci3  C44  cgg  C13-C44  C12-CM  Pi/Zf  M2/^ 

_ GPa _ 

Al3Ru(o)  164  236  259  143  115  106  89  9  54  0.65  0.35 

Al3Ti(*)  106  218  218  58  46  92  117  -46  -59  0.87  0.79 

Calculated  in  the  present  work 
W  Measured  in  ref.  (17) 


DiawwigB 

We  need  to  answer  the  question,  do  our  calculations  indicate  that  AlgRu  deserves  some  experimental  attention? 
It  is  usually  supposed  that  only  cubic  intennetallics  will  have  sufficient  room  temperature  ductility  to  he  useful, 
and  we  have  seen  that  it  is  most  unlikely  that  an  LI2  fivm  of  AI3RU  can  he  stabilised  by  alloying.  Two  points  are  at 
issue  here,  lack  of  ductility  due  to  violation  of  Von  Mises  criterion;  and  intrinsic  lack  of  cleavage  strength.  The  slip 
system  a[110](ll2)  favoured  by  AI3V  (and,  we  expect,  also  AI3RU)  has  only  four,  rather  than  the  five  independent 
i^es  necessary  for  general  ductility;  this  may  however  be  sufficient,  pi^icularly  if  augmented  vrith  basal  slip 
for  which  the  p/if  ratio  is  more  favourable  in  AlgRu  than  AlsTi,  which  is  seen  to  have  a  very  unfavourable  fi/K 
ratio  on  its  chosen  slip  system.  This  would  explain  the  brittleness  of  AljTi,  although  we  don’t  yet  understand 
what  determines  the  choice  of  slip  system  in  the  transition  metal  trialuiiunides.  Our  calculations  show  significant 
differences  between  the  elastic  properties  of  AI3R0  and  Al3Ti.  The  negative  Cauchy  discrepancies  in  both  LI2 
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and  DO23  AlsTi  indicate  large  wgular  forces  and  low  bond  mobility.  In  AI3RU,  although  angular  forces  are  still 
expected  to  play  an  important  role  (8),  the  elastic  constants  in  Llj  are  rather  similar  to  those  of  the  ductile  crystal 
cr'-Ni3Al.  It  is  important,  also,  to  note  that  whereas  all  the  known  DO22  and  DO23  transition  metal  trialuminides 
have  axial  ratios  10-15%  larger  than  ideal,  AI3RU  is  predicted  to  have  an  axial  ratio  very  nearly  ideal,  so  that  the 
(112)  planes  are  genuinely  close-packed.  These  observations  lead  us  to  expect  that  a  ductile  DO22  alloy  of  AI3RU, 
possibly  with  ternary  additions,  can  be  found. 

An  important  influence  will  be  the  anti-phase  boundary  (APB)  energy  on  (112)  planes.  If  this  is  low  in  AI3RU 
compared  with  AI3V,  then  partial  dislocations  with  half  the  Burgers  vector  of  those  observed  in  AI3V  will  be 
responsible  for  the  deformation,  with  a  consequent  enhancement  of  ductility.  In  future  work,  we  will  concentrate 
on  a  comparison  of  elastic  constants  and  APB  energies  in  AI3RU  and  AI3V.  This  will  give  a  better  indication  of 
the  promise  of  AI3RU  as  a  ductile  trialuminide,  and  also  assist  our  understanding  of  the  choice  of  slip  and  twinning 
systems  in  the  transition  metal  trialuminides. 

Conclusions 

By  taking  a  flexible  approach  to  density-functional  theory  in  the  local  density  approximation,  we  can  make 
models  to  infer  general  trends,  and  highly  accurate  calculations  to  predict  specific  physical  properties  of  inter- 
metallic  compounds. 

We  have  applied  this  to  the  transition  metal  trialuminides  to  get  a  picture  of  their  chemical  bonding  and  to 
see  general  trends  in  their  phase  stability,  and  also  to  take  a  detailed  look  at  the  lattice  and  elastic  constants  of 
AI3RU.  We  have  demonstrated  the  remarkable  fact  that,  using  the  structural  energy  difference  theorem  and  rigid 
band  model,  one  can  predict  the  energy  ordering  id  AI3RU  using  the  electronic  structure  of  Al3Ti,  and  vice  versa. 
The  results  of  the  rigid  band  calculations  are  also  consistent  with  the  trends  in  the  phase  stability  of  the  transition 
metal  trialuminides,  given  that  axial  ratio  distortions  in  the  non  cubic  phases  are  unaccounted  for  in  our  model. 
Accurate,  full-potential  calculations  indicate  distinct  differences  between  AI3RU  and  Al3Ti.  We  expect  that  AI3RU 
will  deform  similarly  to  AI3V,  but  may  well  have  enhanced  ductility  since  it  has  ideal  axial  ratio.  Differences  in 
APB  energy  on  the  close-packed  planes  will  be  investigated  in  future  work  to  confirm  or  deny  this.  Unlike  other 
transition  metal  trialuminides,  AI3RU  shows  some  promise  of  room  temperature  ductiUty,  even  though  it,  and  its 
ternary  alloys,  are  unlikely  to  have  a  structure  other  than  D022- 
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